
PATENT 



CO 

CD 



Examiner 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Botstein, et al. 

Listed Below 

Listed Below 

SECRETED AND 
TRANSMEMBRANE 
POLYPETIDES AND 
NUCLEIC ACIDS ENCODING 
THE SAME 

UNKNOWN 



ASSOCIATE POWER OF ATTORNEY 



Commissioner for Patents 

P.O. Box 1450 

Alexandria, VA 22313-1450 

Dear Sir: 

The registrants of Knobbe, Martens, Olson & Bear, LLP, 2040 Main Street, Fourteenth 
Floor, Irvine, CA 92614, Telephone No.: (949) 760-0404, Customer No. 30,313 are hereby 
appointed an associate attorney to prosecute the applications listed below and take all actions 
respecting the same. 

Please address all future communications relative to the pending patent applications listed 
below to the following address: 



AnneMarie Kaiser 
KNOBBE, MARTENS, OLSON & BEAR, LLP 
2040 Main Street 
Fourteenth Floor 
Irvine, CA 92614 
(949) 760-0404 



BEST AVAILABLE COPY -1- 



Appl. No. 
Filed 



Listed Below 
Listed Below 



Appln. 


AtjdIii. Date 


Docket No. 


No. 






09/866034 


5/25/2001 


GNE.2930R1C1 


10/033167 


12/27/2001 


GNE.2930R1C10 


10/692072 


10/22/2003 


GNE.2930R1C10C1 


10/213535 


8/7/2002 


GNE.2930R1C13 


10/033244 


12/27/2001 


GNE.2930R1C2 


10/032996 


12/27/2001 


GNE.2930R1C3 


10/033396 


12/27/2001 


GNE.2930R1C4 


10/033223 


12/27/2001 


GNE.2930R1C9 



Please use Customer No. 30,313 for all communications. 

Respectfully submitted, 



GENENTECH, INC. 





Dated: ll-Ql-^LQO^ By: CM^yJUc^ 7^V^ LU<e^ 

Elizabeth^. Barnes, Ph.D., J.D. 
Registration No. 35,059 

Title: Patent Counsel 

Address: 1 DNA Way 

South San Francisco, CA 94080 



S:\DOCS\BSO\BSG-1 563.D0C 
102504 



OXFORD UNIVERSITY PRESS 
Oxford New York Tokyo 
1994 



1196 i CHAPTER THIRTY-NINE 



~10x). Also, some tumor lines have amplified ras 
genes. A 20-fold increase in the level of a nontrans- 
forming Ras protein is sufficient to allow the 
transformation of some cells. The effect has not 
been fully quantilaled, but it suggests the general 
conclusion that oncogenesis depends on over- 
activity of Ras protein, and is caused either by 
increasing the amount of protein or (probably more 
efficiently) by a variety of mutations that increase 
the activity of the protein. 

Transaction by DNA can be used to transform 
only certain cell types. Although transforming 
oncogenes have been isolated from both rodent and 



human cells, most targets for transformation by 
transfection with oncogenes have been rodent 
fibroblasts in culture. (In fact, the difference in the 
source of the oncogene [human] and the recipient 
cell [rodent] is an important factor in allowing 
the donor gene to be distinguished unequivocally 
from recipient DNA.) Limitations of the assay 
explain why relatively few oncogenes have 
been detected by transfection. This system has 
been most effective with ras genes, where there 
is extensive correlation between mutations that 
activate c-ras genes in transfection and the 
occurrence of tumors. 



Insertion, translocation, orja^^^^^^s^'^W^^Z^> . 



A variety of genomic changes can activate proto- 
oncogenes, sometimes involving a change in the 
target gene itself, sometimes activating it without 
changing the protein product. In cases of insertion 
and translocation, there is evidence that the 
genomic change is the causative event; in cases of 
amplification there is a correlation with tumori- 
genesis, but no direct proof for a causative role. 

Many tumor cell lines have visible regions of 
chromosomal amplification, as shown by homo- 
geneously staining regions (see Figure 56.27) or 
double minute chromosomes (see Figure 56.28). In 
some cases, the amplified region contains a known 
oncogene or a gene related to one. In other cases, 
where amplification is not visible, the use of 
batteries of probes representing oncogenes shows 
that a particular oncogene is amplified. Examples 
of oncogenes that are amplified in various tumors 
include c~myc, c-abl, c-myb, c-erbB, and c-K-ras. 

Established cell lines are prone to amplify genes 
(it is one of several karyotypic changes to which 
they are susceptible). .All the same, the presence of 
known oncogenes in the amplified regions, and the 
consistent amplification of particular oncogenes in 
many independent tumors of the same type, again 



strengthens the correlation between increased 
expression and tumor growth. Of course, it is 
possible that the gene amplification gives an 
advantage to growth of the established tumor; it is 
not necessarily an event involved in its initiation. 

Some proto-oncogenes are activated by events 
that change their expression, but which leave their 
coding sequence unaltered. The best characterized 
is c-myc, whose expression is elevated by several 
mechanisms. One common mechanism is the 
insertion of a nondefective retrovirus in the vicinity 
of the gene. 

The ability of a retrovirus to transform without 
expressing a v-onc sequence was first noted during 
analysis of the bursal lymphomas caused by the 
transformation of B lymphocytes with avian virus. 
Similar events occur in the induction of T cell 
lymphomas by murine leukemia virus. In each case, 
the transforming potential of the retrovirus seems to 
lie with its LTR rather than with a coding sequence. 

In many independent tumors, the virus has 
integrated into the cellular genome within or close 
to the c-myc gene. The gene consists of three 
exons; the first represents a long nontranslated 
leader, and the second two code for the c ; Myc 



OiNCOGENES: GENE EXPRESSION AND CANCER I 1197 



protein. Figure 59.8 summarizes the types of in- 
serlion at tliis locus. The retrovirus may be inserted 
at a variety of locations relative to the c-myc gene. • 

The simplest insertions to explain are those that 
occur within the first intron. The LTR provides a 
promoter, and transcription reads through the two 
coding exons. Transcription of c-myc under viral 
control differs in two ways from its usual expres- 
sion: the level of expression is increased (because 
the LTR provides an efficient promoter); and the 
transcript lacks its usual nonlranslaled leader 
(which may usually limit expression). 

Activation of c-myc in the other two classes of 
insertions reflects different mechanisms. The 
retroviral genome may be inserted within or 
upstream of the first intron, but in reverse orienta- 
tion, so that its promoter points in the wrong 
direction. Probably the LTR provides an enhancer 
that acts on an upstream sequence that fortuitously 
resembles a promoter. The retroviral genome also 
may be inserted downstream of the c-myc gene, in 
which case transcription initiates at the usual c-myc 
promoter(s) , but is increased by the enliancer in the 
retroviral LTR. 

In all of these cases, the coding sequence of c-myc 
is unchanged, so oncogenicity is attributed to the loss 
of normal control, and increased expression, of the 
gene. 

Other oncogenes that are activated in tumors by 
\ the insertion of a retroviral genome include c-erbB s 
I c-myb, c-mos, c-H-ras, and c-raf Up to 10 other eel- 
j. hilar genes (not previously identified as oncogenes 
jrby their presence in transforming viruses) are 
£ implicated as potential oncogenes by this criterion. 
|The best characterized among this latter class are 
fMmil and m/2. The wntl gene codes for a protein 
ginvolved in early embryogenesis that is related to 
| the wingless gene of Drosophila; int2 codes for a 
jtgrowth factor related to FGF. 
fr Translocation to a new chromosomal location is 
| another of the mechanisms by which oncogenes are 
pictivaled. Certain chromosomal translocations are 
^consistently associated with activation of onco- 
penes that lie near the breakpoints. This situation 
|3vas originally discovered via a connection 
between the loci coding immunoglobulins and the 



Figure 39.8 
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occurrence of certain tumors. Specific chromo- 
somal translocations are often associated with 
plasmacytomas in the mouse and with Burkitt 
lymphomas in man. These tumors arise from 
undifferentiated B lymphocytes. The common 
feature in both species is that an oncogene on one 
chromosome is brought into the proximity of an Ig 
locus on another chromosome. Similar events 
occur in T lymphocytes lo bring oncogenes into the 
proximity of a TcR locus. 

The basic cause of the translocation event is a 
malfunction of the system responsible for recom- 
piling V and C genes or switching between TgH 
C genes. Instead of acting on two sites within the Ig 
or TcR locus, the system recombines the immune 
locus with an unrelated region on a different 
chromosome. This results in a reciprocal 
translocation, which is illustrated in Figure 59.9. 

We do not know the basis for the involvement of 
the nonimmune partner, but in both man and 
mouse it is often the c-myc locus. In man, the 
translocations in B cell tumors usually involve chro- 
mosome 8, which carries c-myc, and chromosome 
14, which carries the IgH locus; -10% involve chro- 
mosome 8 and either chromosome 2 (k locus) 
or chromosome 22 (lambda locus). The trans- 
locations in T cell tumors often involve 
chromosome 8, and either chromosome 14 (which 
has the TcR« locus at the other end from the Ig locus) 
or chromosome 7 (which carries TcR /3 locus). 
Analogous translocations occur in the mouse. 

Translocations at the IgH locus in B cells fall into 
two classes. One type is similar to those observed 
at other Ig loci and at TcR loci, involving the 
consensus sequences used for V-D-J somatic 
recombination of active Ig genes. In the other type, 
the translocation occurs at a switching site, so these 
cases may be associated with function of the 
system that 'switches expression from one C|, gene 
to another. 

When c-myc is translocated to the Ig locus, its 
level of expression is usually increased. The 
increase varies considerably among individual 
tumors, generally being in the range from 2 lo 10. 
Why does translocation activate the c-myc gene? 
The Iranslocation event does not involve fixed sites, 
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hut occurs at a variety of locations within a general 
region on each recombining chromosome. The 
event has two consequences: c-myc is brought into 
a new region, one in which an lg or TcR gene was 
actively expressed; and the -structure oflhe c-myc 
gene may itself be changed (but usually not involv- 
ing the coding regions). It seems likely that several 
different mechanisms can activate the c-myc gene 
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in ils new location (just as retroviral insertions 
activate c-myc in a variety of ways). 

The correlation between the tumorigenic 
phenotype and the activation of c-myc by either 
insertion or translocation suggests that continued 
high expression of the c-Myc protein is oncogenic. 
Expression of c-myc must be switched oft to enable 
ininiature lymphocytes to differentiate into mature 
B and T cells; failure to turn off c-myc maintains the 
cells in the undifferentiated (dividing) slate. The 
oncogenic potential of c-myc has been demon- 
strated directly by the creation of transgenic mice 
carrying a normal c-myc gene linked to an 
enhancer. Transgenic mice carrying a c-myc gene 
linked to a B lymph ocyte-speciJIc enhancer (the IgH 
enhancer) develop lymphomas. The tumors repre- 
sent both immature and mature B lymphocytes, 
suggesting that over-expression of c-myc is tumori- 
genic throughout the B cell lineage. Transgenic 
mice carrying a c-myc gene under the control of the 
LTR from a mouse mammary tumor virus, 
however, develop a variety of cancers, including 
mammary carcinomas. This suggests that 
increased or continued expression of c-myc trans- 
forms the type of cell in which it occurs into a 
corresponding tumor. Specificity of the tumor type 
may therefore depend on the mechanism used 
to activate c-myc; it is not an intrinsic property 
of the gene. 

c-myc exhibits three means of oncogene 
activation: retroviral insertion, chromosomal 
translocation, and gene amplification. The com- 
mon thread among them is increased expression of 
the oncogene rather than a qualitative change in its 
coding function, although in at least some cases 
the transcript has lost the usual (and possibly 
regulatory) nontranslated leader, c-myc provides 
die paradigm for oncogenes that may be effectively 
activated by increased (or possibly altered) 
expression. 

Every translocation generates reciprocal prod- 
ucts; sometimes a known oncogene is activated in 
one of the products, but in other cases it is not 
evident which of the reciprocal products has 
responsibility for oncogenicity. Also, it is not 
axiomatic that the gene(s) at the breakpoint have 



responsibility; for example, the translocation could 
provide an enhancer that activates another gene 
nearby. 

A variety of translocations found in B and T cells 
have identified new oncogenes. In some cases, the 
translocation generates a hybrid gene, in which an 
active transcription unit is broken by the transloca- 
tion. This has the result that the exons of one gene 
may be connected to another. In such cases, there 
are two potential causes of oncogenicity: the proto- 
oncogene part of the protein may be activated in 
some way that is independent of the other part, for 
example, because it is over-expressed under ils 
new management (a situation directly comparable 
with the example of c-myc); or the other partner in 
the hybrid gene may have some positive effect that 
generates a gain-of-function in the part of the 
protein coded by the proto-oncogene. 

One of the best characterized cases in which a 
translocation creates a hybrid oncogene is provided 
by the Philadelphia (PW) chromosome present 
in patients with chronic myelogenous leukemia 
(CML). This reciprocal translocation is too small to 
be visible in the karyotype, but links a 5,000 kb 
region from the end of chromosome 9 carrying c-abl 
to the bcr region of chromosome 22. The bcr (break- 
point cluster region) was originally named to 
describe a region of -5.8 kb within which break- 
points occur on chromosome 22. Different cases of 
CML have breakpoints at different locations within 
this region. 

The consequences of this translocation are sum- 
marized in Figure 59.10. The bcr region lies within 
a large (>90 kb) gene, which is now known as the 
bcr gene. The breakpoints in CML usually occur 
within one of two introns in the middle of the gene. 
The same gene is also involved in translocations 
that generate another disease, ALL (acute lym- 
phoblastic leukemia); in this case, the breakpoint in 
the bcr gene occurs in the first nitron. 

The c-abl gene is expressed by alternative 
splicing that uses either of the first two exons. The 
breakpoints in both CML and ALL occur in the 
intron that precedes the first common exon. 
Although the exact breakpoints on both chro- 
mosomes 9 and 22 vary in individual cases, the 
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daltons of the usual -145,000 c-Abl protein, tliat is, 
it has lost just a few N-lerminal amino acids of the 
c-Abl sequence. 

Why is the fusion protein oncogenic? It relies 
on an interaction between the N-terminal region 
provided by bcr with the c-Abl protein. The bcr gene 
has a variety of sequence motifs related to proteins 
involved in signalling pathways, but the pertinent 
one is a serine/threonine kinase activity thai is 
coded within the first exon. This autophosphory- 
lates residues in this part of the protein, and the 
phosphorylation confers the ability to interact with 
a region of the c-Abl protein called the SH2 domain 
(we discuss the nature of SH2 domains later). This 



raises the possibility that the Bcr part of the pro- 
tein interacts with the c-Abl sequences, perhaps 
changing their conformation and activating a 
latent oncogenic potential. 

Changes at the N-lerminus are involved in the 
activation of oncogenic activity of v-abl> a tran$- 
forming version of Hie gene carried in a retrovirus. 
The c-abl gene codes for a tyrosine kinase activity; 
this activity is essential for transforming potential 
in oncogenic variants. Deletion (or replacement) of 
the N-terminal region activates the kinase activity 
and transforming capacity. So the N-terminus 
provides a domain that usually regulates kinase 
activity; its loss may cause inappropriate activation. 



■% 



The common theme in the role of oncogenes in 
lumorigenesis is that increased or altered activity of 
the gene product is oncogenic. Whether the onco- 
w gene is introduced by a virus or results from a 
H mutation in the genome, it is dominant over its 
H allelic proto-oncogene(s). A mutation that activates 
I a single allele is tumorigenic. Tumorigenesis then 
I results from gain of a function. 
I Certain tumors are caused by a different mech- 
1 anism: loss of both alleles at a locus is tumorigenic. 
| Propensity to form such tumors may be inherited 
I through the germline; it also occurs as the result of 
somatic change in the individual. Tumorigenesis 
then results from loss of function. Such cases iden- 
£ tify tumor suppressors: genes whose products are 
needed for normal cell function, and whose loss of 
function causes tumors. The two best characterized 
genes of this class code for the proteins RB and p55. 

Retinoblastoma is a human childhood disease, 
involving a tumor on the retina. It occurs both as a 
| heritable trait and sporadically (by somatic muta- 
tion). It is often associated with deletions of band 
qH of human chromosome 15. The RB gene has 
\ been localized to this region by molecular cloning. 
Figure 59.11 illustrates the situation. 



Retinoblastoma arises when both copies of the RB 
gene are inactivated. In the inherited form of the 
disease, one parental chromosome carries an alter- 
ation in this region, usually a deletion. A somatic 
event in retinal cells that causes loss of the other 
copy of the RB gene causes a tumor. In the sporadic 
form of the disease, the parental chromosomes are 
normal, and both RB alleles are lost by (individual) 
somatic events. 

Almost half the cases of retinoblastoma show 
deletions at the RB locus. In other cases, transcripts 
of the locus are either absent or altered in length. 
The protein product is absent from retinoblastoma 
cells. The cause of the tumor is therefore loss of 
protein function, usually resulting from mutations 
that prevent gene expression (as opposed to point 
mutations that affect function of the protein prod- 
uct). Loss of RB is involved also in other forms of 
cancer, including osteosarcomas and small cell 
lung cancers. 

What is the molecular function of RB? It interacts 
with a variety of other proteins, including several 
tumor antigens: SV40 T antigen, adenovirus E1A, 
human papilloma virus E7. One possibility is that 
part of the oncogenicity of these proteins is due to 
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AMPLIFICATION OF CELLULAR ONCOGENES IN CANCER CELLS 
K. ALITALO 

FROM THE DEPARTMENT OF VIROLOGY, UNIVERSITY OF HELSINKI, HELSINKI, FINLAND 

ABSTRACT 

Regulatory or structural alterations of cellular oncogenes have been implicated in the causation of Ynriou 
cancers. Oncogene alteration by point mutations can result in a protein product with strongly enhanced on* 
cogenic potential. Aberrant expression of cellular oncogenes may be due to tumour-specific chromosomal 
translocations that dysregulate the normal functions of a proto-oncogene. Amplification of cellular otica. 
genes can also augment their expression by increasing the amount of DNA template available for the pro* 
duction of mRNA, It appears that amplification of certain oncogenes is a common correlate of iru< progrcs' 
sion of some tumours and also occurs as a rare sporadic event affecting various oncogenes in different tyn C ^ 
of cancer. Amplified copies of oncogenes may or may not be associated with chromosomal abnormalities 
signifying DNA amplification: double minute chromosomes and homogeneously staining chromosomul 
regions. Amplified oncogenes, whether sporadic or tumour type-specific, are expressed at elevated levels, 
in some cases in cells where their diploid forms are normally silent. Increased dosage of an amplified 
oncogene may contribute to the multistep progression of at least some cancers. 

KEY WORDS: CELLULAR ONCOGENES, GENE AMPLIFICATION. MULTISTEP CARCINOGENESIS, CLONAL SELECTION 
KARYOTYPIC ABNORMALITIES, DOUBLE MINUTE CHROMOSOMES, HOMOGENEOUSLY STAINING CHROMOSOMAL 
REGIONS 



DNA SEQUENCE AMPLIFICATION AND 
. CYTOGENETIC ABNORMALITIES 
IN TUMOURS 

Since its discovery in drug-resistant eukaryotic 
cells, somatic amplification of specific genes 
has been implicated in an increasing variety 
of adaptive responses of cells to environmental 
stresses (70, 79). Cytogenetic abnormalities, 
double minute chromosomes (dmin:s) asso- 
ciated with DNA amplification had already 
been discovered in tumour cells before the 
discovery of dmin:s and homogeneously stain- 
ing chromosomal regions (HSR:s) in cells se- 
lected for drug-resistance (12, 24, 49, 50, 56). 
In metaphase spreads, dmin:s appear as small, 
spherical, usually paired chromosome — like 
structures that lack a centromere (Fig. 2). HSR:s 
stain with intermediate intensity throughout 
their length rather than with the normal pat- 
tern of alternating dark and light bands in 
both trypsin-Giemsa (Fig. 3A) and quinacrine 
dihydrochloride-stained preparations. Both 
kinds of abnormalities are occasionally found 
in metaphases of freshly isolated cancer cells 
but not of normal cells (8). 

Dmin:s and HSR:s are apparently rare in 
tumour cells in vivo, although exact data are 



difficult to obtain since the abnormalities arc 
easily missed in routine cytogenetic analysis 
(8, 42). Dmin:s and HSR:s have been de- 
scribed in most types of in vitro-cultured malign 
nant tumour cells, with a notable frequency 
in neuroblastoma cell lines (11). Initial growth 
in cell culture apparently selects for tumour 
cells that contain either dmin:s or HSR:s. 
Moreover, in culture dmin:s are frequently 
lost, concomitant with the appearance of 
clonal populations of cells that have devel- 
oped an HSR, suggesting that the two cyto- 
genetic abnormalities are alternative forms of 
gene amplification and that HSR:s may confer 
a selective advantage on cells over dmin:s (1 1, 
70). It has been assumed that HSR:s can break 
down to firm dmin:s and that dmin:s can in- 
tegrate into chromosomes to generate HSR* 
(11, 23). Amplified genes may also occupy ab- 
normally banding regions, ABR:s (51, 59). Ex- 
perimental work on drug-resistant cells has 
shown that in the absence of a selection pres- 
sure (drug), dmin.s and the amplified genes 
that they contain are lost, whereas amplified 
DNA in the form of HSR:s is retained in the 
cells (71). This is explained by the fact that 
dmin:s are segregated unevenly in mitosis and 
frequently get lost from the nucleus due to 
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Amplification of cellular oncogenes in cancer cells 

TABLE 1 
Currently known oncogenes. 
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ONCOGENES FOUND IN RETROVIRUSES 






Retrovirus 
(example) 


Oncogene 




Gene product 






Cellular location 


Function of protein 


Class 


ocu 
K5V 

Y73V 

GRFeSV 

Ab'MuLV 

FuSV 

ST-and 

GA-FeSV 

UR2V 


src 
yes 
fgr 
abl 
fps/fes 

fes/fps 

ros 


Plasma membrane 
Plasma membrane 
Plasma membrane 
Plasma membrane 
Cytoplasm 
(plasma membrane?} 
Cytoplasm 
(cytoskeleton?) 


Tyrosine-specific 
protein kinases 
[fgr contains sequences 
homologous to actin) 


Class la {Cytoplasmic 
tyrosine protein kinases) 


AEV 

SMFeSV 

MH-2V 

39H-MSV 

Mo-MSV 


crb-B 

fms 

mil/raf 
raf/mil 
mos 


Plasma membrane 
and cytoplasmic 
membranes 
Plasma membrane 
and cytoplasmic 
membranes 
Cytoplasm 
Cytoplasm 
Cytoplasm 


EGF receptor's cyto- 
plasmic domain 

Cytoplasmic domain of 
a growth factor receptor? 

? 
? 

7 


Class lb (Class la-related 
proteins) 


SSV 


sis 


Secreted 


PDGF-like growth factor 


Class 2 (Growth factors) 


Ha-MSV 
Ki-MSV 


Ha-ras 
Ki-ras 


Plasma membrane 
Plasma membrane 


GTP -binding proteins 


Class 3 {Cytoplasmic 
GTP:ases) 


FBJ-MuSV 

OK-10V 

AMV 


fos 

myc 

myb 


Nucleus 
Nucleus 
Nucleus 


? 

Nuclear matrix protein 

7 


Class 4 (Nuclear phospho- 
proteins) 


SKV 770 
REV 
AEV 
E26V 


ski 
rel 

erb-A. 
ets 


Nucleus? 

7 
? 
? 


? 
? 
? 
7 


Unclassified 


ONCOGENES FOUND IN TUMOUR CELLS BUT NOT IN RETROVIRUSES 
Tumour cell 


Neuroblastoma 
Neuroblastoma 
Small-cell lung cancer 
Neuro-/Glioblaslomas 


N-ras 
N-myc 
h-myc 
neu 


Plasma membrane 

? 

? 

Plasma membrane 


GTP— binding 
? 

7 

Growth factor receptor 


Class 3 
Class 4 
Class 4 
Class lb 
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their lack of centromeres, (49). HSR chromo- 
somes carry centromeres and are therefore 
divided equally at mitosis. If dmin:s and 
HSR:s contain amplified genes that encode 
growth-stimulating protein products, it would 
follow that the more stable chromosomal 
form, the HSR, confers a greater selective 
growth advantage for cells. Although dmin:s 
and HSR:s have been described predominant- 
ly in tumour cells selected for resistance to cy- 
totoxic drugs, it is also clear that dminrs and 
HSR;s may be present in cancer cells before 
ai >y form of therapy (8). It was in this setting 
that we and others first chose to explore the 
Possible amplification of cellular oncogenes, 
toy definition, cellular oncogenes are normally 



innocent genetic loci which can be activated 
to transforming genes in various ways). 1 



DMIN:S AND HSR:S CONTAIN AMPLIFIED 
ONCOGENES 

Table 2 summarizes the somatic amplifica- 
tions of cellular oncogenes so far reported in 

1 It is not the purpose of this review to deal with all forms 
oi r DNA damage that have been found to activate cel- 
lular oncogenes. For the purpose of integrating the re- 
view into a coherent picture, however, the reader is 
given a list of known cellular oncogenes in Table 1 and 
the schematic Figure 1 illustrating the various ways in 
which the oncogenic potential of different proto-onco- 
genes can be activated. 
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Fie 1 Activation of cellular oncogenes. The haploid complement of a proto-oncogene is schematically depicted in A 
composed of three exons (black boxes) in a segment of DNA. The different activated forms are schematically outlined 
in B—G. The abbreviation c-okc stands for cellular oncogene, v-onc viral oncogene. DNA sequences wUh associated 
strong promoter/enhancer functions are striated, and an actively transcribed gene is marked ^£ 
transforming retroviruses have the capacity to transduce cellular oncogenes c-o*c| into their genome modi y them 
and reinsert their activated oncogenes |v.onc| into the genome of host animal cells as a part of their proviral forms. 
The activity of the v-onc gene is greatly enhanced due to the associated promoter of the proviral long terminal repeat. 
Both increased dosage of the oncogene and structural mutations within its sequence may C0 ^^^^^S^ 
C. Slow transforming retroviruses without oncogenes replicate and reinsert their proviral copies into ^ ^""9^ 
during a latency period from infection to tumorigenesis. Tumor initiation through hyperplastic growth may begin, 
when the provirus integrates sufficiently close to a proto-oncogene to activate it through promoter <"^w tac- 
tions It should be noted, however, that mutations have also been found in the oncogenes thus activated and that mula- 
^dSXSSZ oncogenes' has been described in the resulting tumors D. In some mou* P^^^f 
retrovirus-like DNA element {directing the synthesis of the so-called intracisternal A-type particles IAPs) has been 
found in association with a transcriptionally activated oncogene c-mos. The IAP insertion also disrupts the 5 par of 
c-mcs (641 E. In humans, as well as in animals, chromosome translocations may place protooncogenes into transenp- 
Lndly active regions of chromatin, where they may be activated. The details of this rnechamsm have not been worked 
out but it is belif ved to occur for cmyc and c«W genes in Burkitt lymphomas and Philadelphia-chromosome positive 
leukemias, respectively {35. 40). F. Increased amounts of oncogene-specific RNA and protein can also result from an 
excess of DNA template for transcription acquired through oncogene amplification. The ^ ™" A ^ 

primarily on this mechanism. G. Mutationally activated oncogenes have been found m nearly one fifth of human mal g- 
nant tumours. Oncogene loci activated by somatic structural mutations are revealed by transection experiments 
where they are introduced into genetic background of nontumourigenic cultured ^'^^^J^J^ 
transforming loci have been cloned and many of them belong to the c-ras oncogene family. It should be pointed [out 
that both structural mutations and either increased expression or activation of a complementing oncogene may be 
required to achieve a fully tumorigenic phenotype |44|. 



tumour cells. Although the sampling of tu- 
mours is at present small, the finding of 
known cellular oncogenes among amplified 
DNA represented by dmin:s and HSR:s of 
cancer cells is provocative. Amplification has 
been found to affect at least five out of twenty 
known cellular oncogenes and the degree of 
gene amplification varies from five to many 
hundred-fold over the single haploid copies 
found in normal cells (see also ref. 18). The 
first amplification reported involved the c-myc 



oncogene (see Table 1| in a promyelocytic leu- 
kaemia cell line HL-60 (20, 25}. The degree of 
c-myc amplification is between 8—32 fold 
both in the HL-60 cell line and in primary leu- 
kaemic ceils from the patient (20, 25). Origi- 
nal clonal lines of HL-60 were later found to 
contain some dmin:s in culture but their num- 
ber was insufficient to establish any clear cor- 
relation with amplified c-myc. Such a correla- 
tion, however, was discovered for c-myc am- 
plification in a neuroendocrine cell line from 
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CHROMO- 
SOMES 



DMIN.S 



I COLO ?'" Ute ? hron >osomes (arrowheads! in 

l| res O i?J 20DM . co l o 'l carcinoma "lis. A. The dmin:s are 
\ K^rL a ,' r tl dots . anlo rg normal chromosomes in 

' CI- '' 10 ' 1 of dm ! n , :s b Y differential centrifugations. B. 
^s.artmg material C. Chromosome fraction, D. Purt 
"ed dmm:s ,Donna George and the author, unpublished 
data and ref. 52). 



celk lo " carcm °™. COLO 320 (5). In these 
cSt ■ a PP rox,ma tely 30-fold amplified 
aZri, P1 ! S Were map P ed either to HSR:s of 
| <J. depending on the particular subline stud- 

Prinia " C f dm,n:s 'Y t er f alread y P r «ent in the 
P>mary tumour cells from this colon carcino- 

S D &V S hkely th ? f ^ had a,so been 
3i i duri n? growth of the tumour in vivo. 

Cr 3 , y ' ampllfied copies of the c-Ki-ros on- 

«mSf,. i e mapped t0 dmin:s and HSR; s of 
nouse adrenocortical tumor Yl [741. An ex- 

C t K 'f h u S Smce revea,ed amplifica- 
fl s that do not show up as dmims or HSR:s. 



mm 



B 



HSR 



° f £ l0ng Md Sh0rt a ™ ™e HSR- 
ffi^rf ? ^t"!! £ as evolved from « X-chromosome 
(52 and unpublished data of C. C. Lin and the author! ft 

The la,,er ,s shown here by in situ-hybridization 
(5, 52). 



Thu s« fo ;. example, the oncogene is 

amplified tn a characteristic marker chromo- 
ni'Scp 1 °? c 11 carcinoma without evidence 

A- ? rK 6 ' F & 41 and in other tum °«s, 
the amplified c-aW and c-myc oncogene loci 
map o abnormally banding regions [ABR:sl in 
translocated or resident chromosomal seg- 
ments, respectively (59, 76). 

TRANSLOCATIONS AND 
ARRANGEMENTS MAY ACCOMPANY 
ONCOGENE AMPLIFICATION ™ r 

The evolution and progression of the karyo- 

SSFo* m0Ur cells is implicated (see ref. 
68). Concomitant with amplification, DNA se- 
quences acquire an increased mobility in the 
genome with extrachromosomal intermediates 
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TABLE 2 

Sporadic and tumour-specific amplification of cellular oncogenes. • 



Tumour cells 



Onco- Fold Chromosomal Expression Remarks 
gene amplified location of elevated 
amplified gene 



References 



Sporadic: 

HL60 (acute promyelo- c-myc 20 x 
cytic leukaemia, M3} 

COLO320 (colon c-myc 30 x 
carcinoma) 

Yl (adrenocortical c-Ki-ras 50 x 
tumourj 

COLO201/205 (colon cmyb lOx 
carcinoma} 



K562 (chronic myelo- 
genous leukaemia, CMLJ 



c-abl 10 x 



8q(ABR) 
dmin, HSR 
dmin, HSR 
marl 

mar(ABR) 



A431 (epidermoid 
carcinomaj 



c-erbB 15— 20 x n.d. 



ML1-3 (acute myeloid c-myb 5-10x n.d. 
leukaemia, M2J 

SK BR-3 (breast c-myc 10 x n.d. 
carcinoma) 

SEWA (polyoma virus- c-myc 30 x n.d. 
induced mouse tumour! 



Lu-65 (lung giant cell 
carcinoma) 

Primary leukemic cells 
from an acute myeloid 
leukemia (M2J patient 



c-myc 8x 
c-Ki-ras 10 x 
c-myc 33 x 



n.d. 
n.d. 
n.d. 



Tumour-specific: 

small-cell lung cancer c-myc up to 80 x n.d. 



Neuroblastomas 
Glioblastomas 



L-myc, 
N-myc 

N-myc up to 250 x dmin, HSR 
c-erbB - — 



Yes 
Yes 
Yes 
Yes 

Yes 
Yes 

Yes 

Yes 
Yes 



n : d. 
n.d. 
n.d. 



Yes 



Yes 



on culture conditions; 
c-myc amplification 
correlates with growth 
as a tumour 
At least some copies of 
c-Ki-ras mutated 



Most amplifications in the 53, 
variant phenotype of SCLC 



69 



X-myc also amplified in U, 48. 72, 
primary tumours of 73,. 75 

advanced grade 

Rearrangements also found Josef 
personal communication Schlessin- 
ger. per- 
sonal com- 
munication 



n.d. = not determined, mar - marker chromosome, M2, M3 refer to the French-American-Brittish classification of 
acute myeloid leukemias. 

♦ At least one case of oncogene amplification in normal germ-line cells has been found (18). 



Amplification present in 20, 25. 59 | 
primary leukaemic cells 
Part of the amplified 4, 5, 52 j 

c-myc sequences rearranged J 
Levels of p 21< 74 j 
protein elevated 

Patient treated with 4, 6,88 

5-fluorouracil prior to 
culruring of the tumour cells 
C K coamplified in the 21, 22, 41, 

marker that may be derived 54, 76 
from chromosome 22, c-abl 
protein-associated tyrosine 
kinase activated 

Amplification linked to 82 
chromosome 7 translocation 
and sequence rearrangements 
Amount of protein product, 
the EGF receptor, elevated (see 36J 
Abnormalities of chromo- 34, 61, 91 
some 6q22-24, where 
c-myb is normally located 

43 

Cells have dmin:s depending Manfred 



cei: 

larj 
ant 
tbe 
chr 
cog 



Schwab, 
personal 
commu- 
nication 
80 

Unpublished 
data of the 
author and 
A. de la 
Chapelle 



visualized as dmin:s, transpositions and trans- 
locations to other chromosomal segments, etc. 
(see 70 for references). There may not be pre- 
ferred chromosomal sites for the apparent 
reintegration of dmims as HSR:s (75|. In at 



least one case, however, an oncogene may 
have been caught amplifying in situ in its resi- 
dent chromosomal site (59). The finding of 
moderately amplified oncogenes also in chro- 
mosomal sites lacking HSR;s suggests that 
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marl 




A B 



Fig. 4. Localization of amplified c-myb in COLO 201/205 
cells by in situ hybridization. Shown is a characteristic, 
large marker chromosome (marl) with G-banding |A) 
and associated c-myb autoradiographic grains |B). Note 
the absence of HSR:s. Marl has probably evolved from 
chromosome number 6, the resident site of the c-myb on- 
cogene in normal cells (34, 88, 91). (Robert Winqvist and 
the author, unpublished data). 



(onco)gene amplification may be more com- 
mon than the structural alterations shown by 
chromosome banding and microscopy (6, 88). 

In at least three cases reported amplification 
has been accompanied by a DNA rearrange- 
ment of the oncogene (5, 20, 82). In the colon 
carcinoma COLO 320 both damaged and nor- 
mal versions of the c-myc gene are amplified 
|5). Although individual cell clones have not 
yet been examined, our unpublished experi- 
ments suggest that the same dmin-containing 
cells harbor and express both normal and re- 
arranged forms of c-myc. The normal version 
?1 the amplified gene, however, predominates 
m COLO 320 cells containing HSR:s; the rear- 
ranged version is present only in what ap- 
pears to be a single copy (Fig. 5). In the 
c nromc myeloid leukaemia (erythroleukae- 
m 'a) cell line K562 an amplified DNA segment 
insists of portions of both the c-ab] onco- 
gene and the immunoglobulin locus (76). 
n both cases abnormal transcripts are prod- 
ced from the rearranged amplified oncogenes 
6 and ref. 22). In K562 cells, the abnor- 
vaf A° bl 011c °S ene product has also been acti- 
a*ed cjs a tyrosine protein kinase (41). It is not 
"own, however, whether structural alter- 

whu ° f the genes P receded amplification or 
nether they were acquired during the process 
J 8 e ne amplification. It seems likely that a 
, nr ornosomal translocation of c-abl to the Q 
K5fi9 preceded DNA amplification in the 
o2 cells, since ail amplified copies were al- 

Cem ea J r f nged [2l] ' with the chan 8 e reminis- 
11 of the Philadelphia translocation (t(9 f 22)| 
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24HES 


t 24 


DM 


1 8 




| 2.3 




i. 0.8 




2.7kbp 





Sstl 

Fig. 5. Amplification and rearrangement of c-myc iri 
COLO 320 cells. 10 /ig of cellular DNA was digested with 
Sst I, electrophoresed, blotted and probed with a v-myc 
Pst I fragment (ref. 2, left panel). Fragments of 2.7 kbp 
and 1.4 kbp are seen in both normal and amplified c-myc 
DNA. The 3.3 kbp fragment is derived from a DNA seg- 
ment of unknown origin translocated to the 5' region of 
c-myc with a concomitant deletion of its first exon (unpub- 
lished data of Manfred Schwab and the author). HSF, 
human skin fibroblasts; DM, COLO 320 DM cells; HSR, 
COLO 320 HSR cells. Different amounts of DNA from 
COLO 320 DM cells as indicated were mixed with calf 
thymus DNA to give 24 jtg of total DNA, cleaved with 
Sst I ( electrophoresed, blotted and probed with a frag- 
ment of 3 'human c-myc sequences. The intensities of the 
2.7 kbp c-myc fragment in different samples were com* 
pared to assess its copy number, estimated to be about 
30 |5). 



found in most chronic myeloid leukaemia tu- 
mours (35, 66-68). Although they have not 
been sequenced, other reported cases of am- 
plified oncogenes are apparently normal on 
basis of mapping with restriction endonu- 
cleases (see Table 2). Therefore we cannot at 
present view mutation as a necessary com- 
panion of oncogene amplification. 



THE MECHANISMS OF GENE 
AMPLIFICATION 

The mechanisms of gene amplification and 
the structure of the amplified DNA have been 
worked out mainly in experimental settings 
involving selection for drug-resistance in cell 
culture (70). Although the mechanisms are 
still incompletely known and may vary in dif- 
ferent cases, some general features have 
emerged. 

A spontaneous degree of illegitimate DNA 
replication seems to exist in normal cells so 
that various segments of DNA are replicated 
more than once during a single cell cycle (37). 
In unselective conditions this DNA is probab- 
ly lost e.g., through formation of micronucleae 
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c-myc 
RNA 
DM HSR 



t)i»M*0 2 - 3kb 



Fig. 6. Comparison of the electrophoretic mobilities of 
c-myc mRNA:s from COLO 320 DM and HSR cells. The 
size of the normal c-myc mRNA is 2.3 kb. The rearranged 
c-myc locus in DM cells (see Kg. 5) seems to be pre- 
dominantly expressed giving rise to a shortened RNA. 



because the newly synthesised extra copies of 
DNA are not covalently linked to chromo- 
somal DNA of mitotic cells (65, 71}. If, how- 
ever, there is a selection pressure to retain an 
increased gene dosage, progressive multipli- 
cation of gene copy number results. The in- 
cidence of cells bearing amplified genes under 
conditions of cytotoxic selection can vary by 
two orders of magnitude and is greatly in- 
creased by the presence of mitogenic sub- 
stances (hormones or tumour promoters) dur- 
ing selection (10, 84, 85) or certain carcino- 
genic or cytotoxic agents before selection (15, 
55, 79, 80, 81, . 85). An interesting hypothesis 
suggested by Varshavsky (84, 85] supposes that 
the origins of DNA replication "fire" (initiate 
replication) illegitimately several times during 
a single cell cycle and that this kind of "repli- 
con misfiring" may be increased by substances 
such as tumour promoters and mitogenic hor- 
mones (10, 84, 85). Mariani and Schimke (55) 
point out that most of the cytotoxic agents that 
increase the incidence of gene amplification 
are inhibitors of DNA synthesis. Aberrant 
replication is known to take place after tran- 
sient inhibition of DNA synthesis arid this re- 
sponse can lead to gene amplification (46, 47, 
55, 90). Mitogenic hormones probably increase 
disproportionate DNA replication, but they 



also enhance the colony forming efficiency of 
drug-resistant cells in selective conditions 1 10(. 

According to the studies of Axel and his col- 
laborators (65), the multiple cycles of un- 
scheduled DNA replication at a single locus 
during a single ceil cycle result in a structure 
schematically outlined in Fig. IF. The hydro- 
gen-bonded amplified copies of DNA depicted 
in Fig. IF must resolve into a tandem linear 
array before the next mitosis. This may well 
occur by homologous recombination between 
any one of several repeated sequences within 
the amplified domain (45, 65). Part of the re- 
combinations would lead to extrachromosornal 
circles possessing an origin for replication (16, 
62); these could be the precursors of dmin:s. 
The unequal recombinations mean that the 
resolved linear structure consists of tandemly 
repeated but heterogeneous units. According 
to Axel's model a gradient of amplification is 
formed so that centrally located sequences are 
amplified more than sequences distal to the 
origin of replication (65). This also has, in fact, 
been found to explain the large, complex DNA 
domain amplified in neuroblastoma cells in 
vivo (38, see also below). 

The chromosomal site of integration of 
transfected genes significantly affects the fre- 
quency and cytogenetic result of their experi- 
mentally induced amplification (83). The 
amplification frequency in some transfor- 
mants has been found to be 100-fold that of 
the others (83). This suggests that there also 
are preferred chromosomal positions for am- 
plification of host cellular genes and that chro- 
mosomal rearrangements may facilitate gene 
amplification by positioning chromosomal se- 
quences in a favorable array. In respect of the 
structural properties of the sequences involved 
in gene amplification, recombinatorially active 
regions have been implicated in experimental 
cases. DNA rearrangements involving restric- 
tion fragment length polymorphisms and 
variation in gene copy number have been de- 
tected in the human genome between clusters 
of short repetitive interspersed DNA sequences 
called Alu family DNA-sequences (17). Such 
inter-Alu sequences have also been detected 
in an extrachromosornal DNA form, including 
covalently closed circles (17, 78). The copy 
number of inter-Alu sequences apparently va- 
ries in an age- and tissue-specific manner (17, 
78), but any comprehensive analysis of the 
phenomenon in human tumours is not yet 
available. It is also not yet clear whether these 
kinds of repetitive sequences are involved in 
generating amplified oncogene sequences in 
dmhr.s or HSR:s in tumours. 
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Fie 8 A. Indirect immunofluorescence for p21<-««" in Yl 
DM cells. Similar fluorescence of the plasma membrane 
was obtained for the Yl HSR cells. Inset (bl shows control 
staining with normal rat serum and inset Id f« ln «S™ 
normal rat kidney cells with the monoclonal antibody 
against p21. 



NRS anti-p21 



Fig. 7. Elevated levels of the p21-«-» protein in Yl cells 
(74) The Yl DM and HSR cells which harbor a 50-fold 
amplified *Ki-ras oncogene (74) and control cells were 
labeled with [^.methionine and the p 21«*^ protein 
las immunoprecipitated, as detailed }74| with normal 
rat serum (NRS) or rat monoclonal anti-p21 serum. The 
"proteins were electrophoresed in a 15 % polyacrylamide 
gel in the presence of SDS. In addition to a major p21 . 
.band, a labeled band at about 16 kd was present in the 
immunoprecipitates. The amount of radioactivity in p21 
i. was about 50 fold that in normal rat kidney eel s. The 
: Kristen sarcoma virus-transformed rat kidney cells (ob- 
1 tained from the American Tissue Culture Collection) also 
•yielded unexpectedly low amounts of the v-Ki-ros 
protein. 



CARCINOGEN-INDUCED GENE 
AMPLIFICATION AND CLONAL 
.SELECTION OF CANCER CELLS 

Although cell sorting experiments have 
! shown a basal spontaneous rate of gene ampli- 
cation in eukaryotic cells (37), this can be 
^creased severalfold by metabolic inhibitors 
°r cytotoxic agents (15, 37, 70, 81, 85). In 
. many respects the latter response is remims- 
4 ; c ent of the so-called SOS-response elicited in 
Wteriae by noxious stimuli (see 28). In a te- 
kological context, the rapid induction of gene 
Amplification that apparently occurs frequent- 
ly through extrachromosomal intermediates 
^ay provide cells with genetic material for 
^bsequent selective pressures operating in 
Armful conditions (60). In cancer cells, the 
Mechanism may enhance the emergence of 



clonal populations of cells with increasingly 
malignant properties (58, 60). Such genetic 
instability of cancer cells is clearly enhanced, 
leading to the rapid evolution of increasingly 
malignant tumour cell populations (19, 58). A 
serious question of practical importance is 
whether drug resistance in treated patients 
also selects cells that have an enhanced ability 
to amplify (onco)genes important for growth 
and progression of the tumour (84, 85). It is 
also possible that some of the carcinogenic in- 
sults caused by mutagens are only expressed 
as a result of subsequent amplification events 
induced by tumour promoters (84, 85) or facil- 
itated by hormones in, say replicating epithelial 
cells (10) The persistence of dmin:s in some 
tumours suggests that there is a selection 
pressure for their retention (8, 9, 11, ZJ). 
Amplified DNA in dmin:s must contain an 
origin for DNA replication (62) and must be 
selected for in daughter cell populations, 
where it is unevenly segregated (71). In the 
' absence of such a selection pressure dmm:s 
are lost (71). In at least one study the length of 
an HSR has been found to increase during a 
selection of malignant cells for enhanced 
tumour igenicity (30). 

The amplified c-erbB gene in A431 cells codes 
for epidermal growth factor receptor (27). The 
abundant amounts of receptor protein on 
A431 cell surface may, however, provide the 
ceils with an abnormal growth response (31). 
A naive supposition is that the amplified se- 
quences in dmin:s and possibly in HSR:s ot 
tumours contain growth-promoting genes (see 
36 for references). This seems to fit well with 
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Fig. 9. Construction of a v-myc expression vector. A synthetic linker (ATG AGAATTCATCATJ containing a translation! 
initiation codon was inserted downstream from the trp promoter in the pSRC ex 16 Rl expression vector described 
previously (see ref. 3). Approximately one half of the v-src sequences coding for the aminoterminal portion of pp60^ 
protein were then deleted and the remaining portion Iigated in translational codon frame with the synthetic ATG. A 
Hinc II fragment of v-myc from plasmid clone MC 38 (nucleotides 320—685 in the v-myc sequence in ref. 2) was Iigated 
downstream form remaining v-src sequences in continuity with its reading frame. The resulting product contained 3 
amino acids from the synthetic linker, 252 amino acids encoded by the 756 base pair fragment from Sma I to Pvu II 
restriction sites in v-src DNA. 122 amino acids from the v-myc and 6 amino acids (corresponding to nucleotides 

2968-2085) from the pBR322 vector (3). 
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j recent findings on amplified oncogenes, though 
' i n many cases the search for an amplified on- 
cogene is still continuing. Even positive find- 
ings do not mandate a role for amplified cel- 
lular oncogenes, however, because the do- 
main of amplified DNA is inevitably much 
larger than a single genetic locus (e.g. 38). 



ENHANCED EXPRESSION OF AMPLIFIED 
ONCOGENES 

In all cases where they have been studied, the 
amplified oncogenes have been found abund- 
antly expressed at the RNA level, roughly in 
proportion to the amount of DNA amplifica- 
tion (see Table 1). Described cases of elevated 
RNA expression include examples of abnor- 
mal |5, 22) and ectopic (6) transcription. In at 
least four cases this enhancement is not li- 
mited to synthesis of RNA (31, 33, 41, 74, 82). 
The Yl cells that have amplified c-Ki-ras con- 
tain exceptionally large amounts of its protein 
product situated on the plasma membrane 
/ (ref: 74, Fig. 7 and 8). High amounts of the 
| c-myc encoded protein are also found in 
COLO 320 cells that have amplified the gene 
1 (33). The myc oncogenes have recently been 
shown to encode nuclear proteins (ref. 1, 3, 
26, 29, 32, 33, Fig. 9-11). Both the expression 
of the c-myc mRNA (39) and the subcel- 
lular localization of myc proteins are linked to 
the cell cycle (ref. 89, Fig. 12). It may be that 
elevated expression of specific c-myc func- 
tions is necessary for cell cycle progression 
} and the growth transformation aspect of the 
| phenotype of cancer cells that may contribute 
to tumour progression (7, 36). Elevated ex- 
j pression of c-myc has been shown to replace 
» part platelet-derived growth factor in in- 
duction of competence for DNA replication 
I?). Generally, enhanced expression of an on- 
cogene could be a necessary prerequisite for 
acquisition of a growth advantage by cells 
having extra copies of the gene. This effect 



+ + + INDUCTION 

CO # <0.OC9,<DO 

ooVdcM 
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Kg. 10. E. coli 294 was transfected with the hybrid v-src, 
v-wyc plasmid outlined in Fig. 9 and ampicillin-resistant 
bacterial colonies were checked for the production of a 
43,000 m.w. bacterial v-myc protein after induction by 
growth to different optical densities in minimal essential 
medium |M9. induction + ) or complete medium (LB in- 
duction-) (3). 



could also be the principal contribution of 
amplification to tumourigenesis. 

TUMOUR CELL AND STAGE SPECIFICITY 
OF ONCOGENE ACTIVATION AND 
AMPLIFICATION 

Tumour cell specificity of oncogene amplifi- 
cation has been found in three malignancies. 
The c-myc, L-myc or N-myc oncogene is ampli- 
fied in most cases of the variant form of small- 
cell lung cancer cells (53, 69), c-erbE is ampli- 
fied in several glioblastomas (Josef Schlessin- 
ger, personal communication] and the putative 
N-myc oncogene is amplified in about half of 
grade III and IV neuroblastomas (14, 72, 73, 
75). In addition to HSR:s, small-cell lung can- 
cers and neuroblastomas frequently show a 
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Fig. 12. Fluorescent staining for DNA and myc protein in 
myelocytomatosis virus-transformed quail cells. In in- 
terphase cells, the myc protein is confined to the nucleae. 
In the mitotic cell, myc fluorescence is distributed 
throughout the cell unlike fluorescence for chromatin, 
which is compacted to chromosomes in the metaphase 
plate. In fact, there is less myc fluorescence associated 
with chromatin than with the rest of the cell. DAPI, dia- 
minophenyl indole DNA stain. The anti-wye protein rab- 
bit antiserum was used in a 1/200 dilution (ref. 89). 



deletion of a portion of the short arm of chro- 
mosome 1 (13) and chromosome 3 (86, 87), 
respectively, in karyological examination. 
Two kinds of changes have also been described 
in different neuroblastoma oncogenes. The 
first is a mutation in the N-ras gene, an ac- 
tivated oncogene that was discovered because 
of its relation to other ras genes and trans- 
forming activity in transfection experiments 
(77). The second is amplification of a distant 
homoiogue of the c-myc gene called N-myc (72, 
73, 75). Although the transforming potential 
of the ti-myc gene has not yet been established, 
its consistent presence in a core segment of 
amplified neuroblastoma DNA (38, 57, 72, 73, 
75) and its elevated expression in most reti- 
noblastomas (48) suggests its oncogenic nature. 

Taya et al. (80) have recently described a 
human lung giant cell carcinoma grown . in 
nude mice, where both c-Ki-ras and c-myc on- 



cogenes were amplified about 10-fold. Besides, 
sequencing studies indicated that at least 
some of the amplified c-Ki-ras copies were 
also mutationally activated in the 12th codon. 
These results fit to the multistage theory of 
cancer development and progression (see 58). 
Apparently co-operating lesions in cellular on- 
cogenes accumulate during tumour growth 
and selection and increase the malignant po- 
tential of the tumour cells (44). 

When does oncogene amplification come in- 
to play during tumourigenesis? Gene amplifi. 
cation may not be any initiating event in carci- 
nogenesis. Amplification and enhanced ex- 
pression of c-myc and N-myc may occur during 
the progression of human carcinoma of the 
lung and neuroblastoma cells to a more malig- 
nant phenotype (14, 53, 73). There may be, 
however, no mandatory sequence of onco- 
gene amplifications for the genesis of any par- 
ticular tumor. Amplification of an oncogene 
could play its part in malignant progression of 
already initiated cells whenever it happened 
to occur. 
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Abstract 

The human epidermal growth factor (EGF) receptor 1$ known 
to be homologous to the y-trb B oncogene protein of the avian 
erythroblastosis virus. Overexpression of the EGF receptor 
gene in A431 epidermoid carcinoma cells is due to gene 
amplification. In this study, a variety of squamous cell carci- 
nomas were examined and one, SCC-15, contained high levels 
of the EGF receptor as determined by immnnopretipitation via 
an EGF receptor-specific polyclonal antibody. Using a cloned 
EGF receptor complementary DNA as a probe, the level of 
EGF receptor RNA was found to be elevated four-fold in SCC- 
15 relative to normal cultured kerati oocytes. When the same 
probe was used to identify EGF receptor gene fragments on a 
genomic DNA blot, the SCC-15 cell line was shown to possess 
an EGF receptor gene copy number amplified four to five 
times. Gene amplification results in the enhancement in the 
level of the EGF receptor in several carcinomas and could be 
responsible for the appearance of the transformed phenotype 
in these cells. 

Introduction 

The epidermal growth factor (EGF) 1 stimulates growth and 
elicits a wide variety of rapid and delayed responses by binding 
to high-affinity cell-surface receptors which are 170-kD glyco- 
proteins (1). Recently, EGF receptor peptides have been se- 
quenced and found to be homologous to the avian erythro- 
blastosis virus erb B oncogene product (2), suggesting that the 
EGF receptor gene is the human oerb B oncogene. A431 
epidermoid carcinoma cells possess a very large number of 
EGF receptors (3), and the EGF receptor gene is amplified 
~ 30-fold (4-6). This amplification is responsible for the 
overexpression of the EGF receptor protein in these cells 
(4-6). 

A cell culture system has been developed permitting serial 
cultivation of keratinocytcs, whose growth is modulated by 
EGF (7). Such methods have been used to establish cell lines 
from squamous cell carcinomas of the oral epithelium (8). 
Because of the role of EGF in keratinocyte development, we 
quantified EGF receptor protein and RNA in several squamous 
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cell carcinomas. One cell line, SCC-15, was found to contain 
high amounts of receptor protein and RNA, and a four- to 
fivefold amplification of the gene. 

Methods 

The squamous cell carcinomas established by Rheinwald and Beckett 
(8) were obtained from, and maintained according to the American 
Type Culture Collection (RockvjJIc, MD). 1623 was originally designated 
as SCC-15; 1628 as SCC-25; and 1629 as SCC-9 (8). Normal human 
esophageal epithelial cells were grown as reported (9). Maintenance of 
other cell lines was as described elsewhere (10). Proteins were labeled 
with ( 35 S]methionine and immunoprecipitated as previously described 
(10). PolyA+ RNA was isolated by guantdine isothiocyanate solubili- 
zation and CsO centrifugation (1 IX and oligo(oT>affimty chromatog- 
raphy. RNA (Northern) blotting was performed as described (II, 12). 
High molecular weight DNA was isolated (4) and analyzed by DNA 
(Southern) blotting (4, 10, 13). The EGF receptor complementary 
DNA (cDNA) clone pE7 was constructed and isolated from an A431 
cDNA horary (II). DNA fragments were "P-labeled by nick translation. 

Results 

A large number of cell lines were initially screened for EGF 
receptor levels by determining their ability to be killed by an 
EGF-pseudomonas exotoxin conjugate, a technique described 
previously (10). Several squamous cell carcinomas were found 
to be relatively sensitive to the EGF-toxin conjugate, including 
SCC-25, SCC-9, and particularly SCC-15, all derived from the 
human tongue (8). These three cell lines were labeled with 
[^methionine, and their extracts immunoprecipitated with 
a goat polyclonal antibody to the EGF receptor, affinity- 
purified as described (10). When compared with A431 cells, 
which make very large amounts of the EGF receptor, SCC-25 
and SCC-9 make moderate amounts and SCC-15 high amounts 
of the receptor (Fig. l, lane a vs. e. g. and c). Quantitation of 
the immunoprecipitation data revealed that SCC-15, SCC-25, 
and SCC-9 make 41, 15, and 4% of the amount of EGF 
receptor made by A431 cells, respectively. 

Because SCC-15 cells had high levels of receptor, polyA+ 
RNA was isolated from these cells, electrophoretically fraction- 
ated on agarose, and analyzed by RNA (Northern) blotting- A 
cloned A43I cDNA (pE7) encoding the EGF receptor (11) 
was 32 P-labeled and used as a hybridization probe to visualize 
EGF receptor RNAs. Fig. 2 A shows that SCC-15 contains 
both the 10- and 5.6-kilobase species of EGF receptor RNA. 
The levels are approximately four- to fivefold higher than 
those found in either KB or A49S kidney carcinoma cells; 
these cell lines were previously found to possess readily de- 
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tectable levels of both receptor RNAs(lO). Fig. 2 B shows that 
cultured human epithelial cells contain EGF receptor-specific 
RNA {HEIA, lane e) whose levels are higher than an early 
passage human fibroblast D55l (lane /), equivalent to A498 
(lane g)> but much tower than SCC-I5 (lane b). 

To determine if an elevated gene copy number was asso- 
ciated with enhanced expression of the EGF receptor gene in 
SCC-15 cells, genomic DNA was isolated from normal cultured 
epithelial cells (HEIA) and SCC-15 cells, digested with ff/ndlll, 
electrophoreticaHy fractionated, and subjected to DNA blotting 
analysis. An EGF receptor cDNA (pE7) was used as a hybrid- 
ization probe to identify receptor DNA fragments. Fig. 3 A 
reveals that the SCC-15 genome contains four- to fivefold 
amplified EGF receptor gene sequences relative to normal 
epithelial cells (lane a vs. b). Analysis of 0-actin gene fragments 
on the same filter by hybridization to a chick actin cDNA 
probe indicated that equal amounts of DNA were loaded per 
well (data not shown). Digested SCC-15 DNA had to be 
diluted about fourfold (Fig. 3 B, lane e) to approximate the 
signal intensity of receptor DNA fragments from SCC-25, 
SCC-9, and KB cells (lanes tf-i). The KB cell EGF receptor 
gene is known not to be amplified (10). 

Discussion 

It may be significant that A43 1 carcinoma cells are not unique 
in their possession of amplified EGF receptor genes. We report 
here that the EGF receptor gene in squamous cell carcinoma 
SCC-15 is amplified four- to fivefold relative to normal epithelial 
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Figure 2. RNA blot analy- 
sis of polyA+ RNAs using 
the "P-labclcd EGF recep- 
tor cDNA probe pE7. {A) 
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Figure S. DNA blot analysis of ///mJIII-digested genomic DNAs 
using the pE7 probe (see Fig. 2). (A) and (B) arc autoradiographs 
from two separate gels. Sizes are in kilobase pairs (right). 10 jig of 
DNA was loaded except in (fl), lanes d-f, which represent serial 
dilutions of the 1623 DNA shown in lane c. 



cells. The amplification of the EGF receptor gene may cause 
the initiation or maintenance of the malignant state in some 
human cells. 

Previously, we reported that a variety of transformed cell 
lines synthesize relatively high amounts of both EGF receptor 
protein and messenger RNA (10). It is conceivable that a 
moderate or even a small increase in the level of the EGF 
receptor leads to a change in the cellular phenotype, as has 
been demonstrated for the sre gene product (14). If this 
hypothesis is correct, then even a minor amplification of the 
EGF receptor gene copy number could contribute to the onset 
of tumorigenesis. Hendler and Ozanne (15) have examined 
lung squamous cell carcinomas and found that they contain a 
2.5-5-fold increase in EGF receptor levels. 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Torben F. 0mtoftt§, Thomas ThyKjaerU, Frederic M. Waldmanll, Hans Wolf**, 
and Julio E. Cells** 



Gain and loss of chromosomal material is characteristic 
of bladder cancer, as well as malignant transformation In 
general. The consequences of these changes at both the 
transcription and translation levels Is at present unknown 
partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-Invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 



phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell llrie 
8T474 has suggested that them is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas fc), and studies of Individual genes In solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels in the case of c-erb-B2. cycUn cn, 
emsl, and N-myc (3-5), However, a high cyclin D1 protein 
expression has been observed without simultaneous am- 



two-dimensional gel electrophoresis/the results showed / x* ), ^ cation ( 4 )» m & a tavv level of c-myc copy number in- 
that them is a n*,ru> V*„* t- crease was observed without concomitant c-myo protein 



that there is a gene dosage effectuiat in some cases 
superimposes on other regulatory mechanisms. This ef- 
fect depended (p < 0.015) on the magnitude of the com- 
parative genomic hybridization change. In general (18 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA showed a corresponding Increase In mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels^ Be- 
cause most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations In relatively few cases of well focused 
abundant proteins, fefth few exceptions we found a good 
correlation (p < 0.006) between transcript alterations and 
protein levels. The Implications, as well as limitations, 
of the. approach are discussed. Molecular & Cellular 
Proteomlcs 1:37-45, 2002, 



. Aneuploldy Is a common feature of most human cancers 
(1), but little is known about the genome-wide effect of this 
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overexpression (6), 

In human bladder tumors, karyotyping, fluorescent In situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non-Invasive pTa transitional cell carcinomas (TCCs), 
this Includes loss of chromosome 9 or parts of it, as well as 
toss of Y In males. In minimally Invasive pTI TCCs, the fol- 
lowing alterations have been reported: 2q- 11p~, lq+, . 
1lq13+, 17q+, antf 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes end onco- 
genes; however, the large cJiwmosomaJ areas Involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this Investigation we have combined gehome-wfde tech- 
nology for detecting genomic gains and losses (CGH) wfth 
gene expression profiling techniques (mlcroanays and pro- 
teomlcs) to determine the effect of gene copy number on 
transcript and protein levels In pairs of non-Invasive and In- 
vasive human bladder TCCs, 

EXPERIMENTAL PROCEDURES 
Afaterfa/- Bladder tumor biopsies were sampled after Informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By tight microscopy tumors 335 and 632 were 
staged by an experienced pathologist as pta (superficial papillary). 



1 The abbreviations used are: CGH, comparative genomic hybrid- 
ization; TCC, transitional celt carcinoma; LOH, loss of heterozygosity- 
PA-fABP. psortaste-associated fatty acid-binding pmteln; 2d! 
two-dimensional. 



$2002 by The American Society tor Biochemistry end Molecular Biology. 
Thte paper fa available on fine at m^wwvwnoponrine.ore 



Molecular & Cellular Proteomlcs iA 37 



Gene Copy Numbers, Transcripts, and Protein Leve ls 




' ^^Ln^«?S y J]^ er and mRN A^s3ton level. Shown from fefr to ifeht are chromosome (Cftr.), GGH profiles, gene location and 
S * * * )ec ? c 9f«. overall expression level along .the chromosome. A, expression of mRNA in IrrvasU Um^ 733^s 

counterpart tumor 335. B, expression of mRNA In Invasive tumor 827 compared Me^nZZ 

Cfeft). The tatf curve In the ratio profile represents a mean of four chromosomes and Is surrounded by tNn curves 

^t^J^^ ^ x ^ s ^ wr^ the nor^asrve tumor 335 used fcrc^ D NA content Mto 

profite of that chromosome is shown to the right of the Invasive tumor profile. The co/drecf bars represents one gene each Sfedb^ 

^^^Jftf^!^ te ^P^ 5 ^ tevelof the gene m the Invasive tumor c^mparod with the rK>n^rr^eS^^W 

h expression akmg the chromosome; the cofors Indicate that at least half of the genes were ujH^ufeted (Wecfc), at least StiSmm^ 
o^^egutoed more than half of the genes are unchanged (orange). If a gene vJ^^^^S^^^ 

determination of -1800 genes. Centromeres and heterochromatlc regions wero excluded from data analysis. 

grade I and U, respectively, tumors 733 and 827 were staged as pT1 
Gnvaslve Into submucosal 733 was staged as solid, and 827 was 
staged as papillary, both grade HI. 

mBNA Preparatfon-Tlssue biopsies, obtained fresh from surgery, 
were embedded Immediately bi a sodiurn-guanldlnlum mlocyanate 
solution and stored at -80 °C. Total RNA was Isolated using the 
RNAzol B RNA isolation method (WAK-Chemfe Medical GMBH). 
poWA)* RNA was Isolated by an oHgo(<fT) selection step (pllaotex 
mRNA kit; CHagen). 

cRNA Preparation—* of mRNA was used as starting material. 
The first and second strand cDNA synthesis was performed using the 
SuperScnpt® choice system (Invftrogen) according to the manufac- 
turer's Instructions but using an oGgo(dT) primer containing a T7 RNA 
.polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrlp® in vitro transcription Wt (Ambton). BiotMabeled CTP and 



OTP (Enzo) was used, together with unlabeled NTPs In the reaction. 
Following me In vftro transcription reaction, the urUncorpomted nu- 
cleotides were removed using RNeasy columns (CMagen). 

Army Hybridization and Scanning— Army hybridization and scan* 
ntng was modified from a previous method (13). 10 fig of cRNA was 
fragmented at 94 °C for 35 mln In buffer containing 40 nut Trls 
acetate, pH 8.1, 100 rrw KQAc, 30 nw MgOAo. Prior to hybridization, 
me fragmenled cRNA In a 6x SSPE-T hybridization buffer (1 m NaCI, 
10 m«Trfs. pH 7.6, 0.005% Triton), was heated to 95 °C for 5 mln, 
subsequently cooled to 40 'C. and loaded onto the Affymetrix probe 
array cartridge. The probe array was then Incubated for 16 h at 40 °C 
at constant rotation (60 rpm). The . probe array was exposed to 10 
washes In 6X SSPSTat 25 *C followed by 4 washes fri OSX SSPE-T 
at 50 *C. The bfotlnytated cRNA was stained- with a streptavlrfin- 
phycoerythrin conjugate, 10 /*g/ml (Molecular Probes) In 6x SSPE-T 
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Fte. 1—continued 



for 30 win at 25 °C followed by 10 washes In 6x SSPE-T at 25 *C.The 
probe arrays were scanned 1 at 560 nm using a confocal laser scanning 
microscope (made for Affymetrtx by Hewlett-Packard). The readings 
from the quantitative scanning were analyzed by Asymetrix gene 



Mtavsafefite Ana/^-Microsatetlrte Analysts was performed as 
described previously (14). Microsatetiftes were selected by use of 
wwwJX3bLnUn.nth.gov/geaemapS8, and primer sequences were ob- 
tained from the genome data base at www.gcfc.org. DMA was extracted 
from tumor and Wood and arnpfifledby PCRhavokimeof20plfor35 
cycles. The ampacons were denatured and etectrophoresed tor 3 h In an 
ABI Prism 377. Data were collected tn the Gene Scan program for 
migrnerit analysis, toss of heterozygosity was deflr^ as less tran 33% 
of one afleJe detected In tumor e/npflcons compared with blood. 

Proteomtc Analysts -TCCs were minced into small pieces and 
homogenized In a small glass homogenlzer In 0.5 ml of lysis solution. 
Samples were stored at -20 °C untfl use. The procedure for 2D gel 
electrophoresis has been described fn detail elsewhere (15, 16}. Gels 
were stained with sliver nitrate and/or Coornassle. Brilliant Blue. Pro- 
teins were Identified by a combination of procedures that Included 
rrOcrosequenclng, mass spectrometry, two^tmensfonal gel Western 
Immunobtotilng, and comparison wittvthe master two-dfmenstonal gel 
Image of human keratlnocyte proteins; see Wobase.dk/cgi-b4n/ceIl3. 

CGH-Hybrldlzatlon of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Ruoresceln-labeled tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng). and human Cot-1 DNA (20 fig) were 
denatured at 37 °C for 5 mln and appDed to denalured normal met- 
aphase slides. Hybridization was at 37 °C for 2 days. After washing, 
trie slides were oounteretained with 0.15 jag/ml 4,&KJiamidino-2-phe- 
nyTrndcte fn an anti-fade solution. A second hybrxiizatkm was per- 
formed for ail tumor samples using ftoorescein-labeled reference DNA 
and Texas ReoMabeted tumor DNA (inverse labeflng) to confirm the 
aberrattpns detected during the Initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital Image analysis was 
used to Identify chromosomal regions with abnormal fluorescence 
ratios. Indicating; regions of DNA gains and losses. The average 
greercred fluorescence Intensity ratio profiles ware calculated using 
four Images of each chromosome (eight chromosomes total) wKh 
normalization of the green: red fluorescence Intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4 P 6-dlamWmo-2-pherrynndoIe band- 
ing patterns. Only Images showing uniform high intensfty fluores- 
cence with minimal background stabling were analyzed All " 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matic regions .were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization— The CGH analysis 
identified a number, of chromosomal gains and losses in the 
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Table I 

Correlation between a/temtfaris detected by CGH end by expression monftoring 

Top, CGH used as Independent variable (ff CGH alteration - what expression ratio was found}? bottom, altered expression used as 
independent variable frf expression alteration - what CGH deviation was found). 



CGH alterations 



Tumor 733 vs. 335 
Expression change clusters 



Concordance CGH alterations 



Tumor 827 vs. 532 
Expression change clusters 



Concordance 



13 Gain 
.10 Loss 



10 Up-reguJatJon 

0 town-regulation 

3 No change 

1 Up-regulation 

5 Pown-reguIaUon 

4 No change 



77% 
50% 



10 Gain. 
12 Loss 



8 Up-regulation 
OOown-regulatton 

2 No change 

3 Up-regutatfon 

2 Down regulation 
7 No change 



Expression change clusters 



Tumor 733 vs. 335 
•CGH alterations 



Concordance Expression change dusters 



Tumor 827 vs. 532 
CGH alterations 



80% 
17% 

Concordance 



16 Up-regulation 
21 Dowrwegulatlon 
15 No change 



11 Gain 

2 Loss 

3 No change 
1 Gain 

8 Loss 

12 No change 
3 Gain 

3 Loss 

9 No change 



69% 
38% 
60% 



17 Up-regulation 
9~DownH-egulation 
21 No change 



10GaIn 

5 Loss 

2 No change 

0 Gain 

3 Loss 

6 No change 

1 Gain 
3 Loss 

17 No change 



69% 
33% 
81% 



; two Invasive tumors (stage pTI , TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532} showed only 9p-, 9q22-q33-, and and 7+, 9q-, 
and respectively. Both Invasive tumors showed changes 
(1q22-24+, 2q14.1-qter- f 3q12-q13.3-, 6q12-q22~, 
9q34+, 11q12-q13+, 17+, and 2Qq11.2-q12+) that are typ- 
ical for their disease stage, as well as. additional alterations, 
some of which are shown In Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 in TCC 733 (Fig, 1A) and 
20q12 in TCC 827 (Fig. 1B). 

mRNA Expression In Relation to DNA Copy Number— The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-Invasive counterparts 
(TCCs 532 and 335). This was done In two separata experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal oh the arrays 
were searched In the Unlgene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096} were plotted as bars covering their purported locus, in 
that way It was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level In 
the Invasive versus the non-Invasive counterpart Ears, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the QGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
trie chromosomaJ aberrations are not known at high resolution. 

Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the Independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
Increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed « relative gain of more 
than 100% In DNA copy number that was accompanied by 
increased mRNA expression levels In the two tumor pairs (Rg. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts In both 
TCCs 733 (77%) and 827 (80%) {Table I, fop). ChromosomaJ 
losses, on the other hand, were not accompanied by de- 
creased expression In several cases, and were often. regis- 
tered as having unaltered RNA levels (Table I, top). The Inabil- 
ity to detect RNA expression changes In these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2-fold as the Independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomaJ material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected In areas with unaltered CGH ratios (Table I, bottom]. 
Furthermore, as a control we looked at areas with no alter- 
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Tumor 827 versus 532 Tumor 733 versus 335 

Fia2« Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide array 
monitoring. The aberration Is shown as a numerical -fold change In ratio between Invasive tumors 827 (A) and 733 (♦) arid their noninvasive 
counterparts 532 and 335. The expression change was taken from the Expression line to the right In Fig. 1, which depicts the resulting 
expression change for a given chromosomal region. At (east half Of the mRNAs from a given region have to be either up- or dowrwegulated 
to be scored as an expression change. All chromosomal arms In which the CGH ratio plus or minus one standard deviation was outskte the 
ratio value of one were Included. 



atlon In expression. No alteration was detected, by CGH In. 
most of these areas (TCC 733, 60% and TCC 827, 81%; see 
Table I, bottom). Because the ability to observe reduced or 
Increased mRNA expression clustering' to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations In the 
regions showing CGH changes against the abflity to detect a 
change In mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2^£gr both tumors TOG 733 (p < 0.0151 and 
TCC 827 <p < 0.0000® a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
.. the DNA copy number) and alterations detected by the array 
based technology (Fig. ^ Similar data were obtained when 
areas with altered expression were used as Independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated tjfr- to 2.0-fold (Table t, bottom) but mostly did 
not at tower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction In expres- 
sion level, which Is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

MlcrosateUite-based Defection of Minor Areas of Loss- 
es— In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1 , TCC 733 
chromosome 1q32, 2p21, and 7q21 and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatell Ites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Rg. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript mcreese/decrease/irv- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Rg.. 1>A). As Indicated above, the mRNA decrease observed In 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the tetomeric end of chro- 
mosome 11p showed a normal ratio In the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two mlcrosatellites (D11S1760, D11S922) 
positioned dose to MUC2, IGF2, and cathepsln D Indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2. 12q21.1, and 16q24 
and In TCC 827 at chromosome 11p15.5, 12p11, 15q1l.2, 
and 18q12 was also examined for chrornosornal fosses using 
mlcrosatellites positioned as close as possible to the gene loci 
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Fio, Mlcrosataflite analysts of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 dose to Hu class I Wstoconipatlbllity antigen (gene 
number 38 .to Rg. 1), (6) by D1SZ735 dose to camepsln E (gene 
number 41 In Rg. 1), and (c) at chromosome 2p23 by D2S2251 close 
to general 0-spectnYi (gene mimber 11 on Rg. 1) and of (d) tumor 627 
showing loss of heterozygosity at chromosome 18q12 by S18S11 18 
dose to mitochondrial 3-cxoacyI-coen2yme A thidase (gene number 
12 In Fig. 1). The upper curves show the electropherogram obtained 
from normal DMA from leukocytes [N), and the tower curves show the 
electropherogram from tumor DNA (7). In ail cases one allele to 
partlaity lost fri the tumor arripJEcon. 

showing reduced rnRNA transcripts. Only the microsatelli'te 
positioned at 18q12 showed LOH {Fig. 3), suggesting that 
transcriptional down-regulation of genes In the other regions 
may be controlled by other mechanisms. 

Relation between Changes In rnRNA and Protein Levels - 
2D-PAGE analysis. In combination with Coomassle Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material, 40 well resolved abundant known 
proteins migrating In areas away from the edges of the pH 



0J 



:l 



C D 



Reduced 
protein 



Unaltered 
protein 



Increased 
protein 



2-D*gel 



Rg. 4. Correlation between protein levels as Judged by 20- 
PAGE and transcript ratio* For comparison proteins were divided In 
three groups, unaltered In level or up- or down-regulated (fiortzontaJ 
axis). The rnRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene (vertfca/ axis), a; mRNAs that were scored as 
present In both tumors used for the ratio calculation; A. mRNAs thai 
were scored as absent In the invasive tumors (along horizontal axis} or 
as absent In non-Invasive reference (fop of figure). Two different 
scaflngs were used to exclude scaling as a confounder, TOCs 827 
and 532 (4-6) were scaled with background suppression, and TCCs 
733 and 335 (ec$ were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences'ln rnRNA ratios 
between the groups. Proteins shown were as follows: Group A (from 
fefl), phosphogluoomutase 1 , glutathione transferase class jt number 
4, fatty acid-binding protein homologue, cytokeratin 16, and cyto- 
keralln 13; B (from tefl), fatty acid-binding protein homologue, 28*Da 
heat shock protein, cytokeratin 13, and calcycUn; C<from tefl), o^no 
iase, hnRNP B1, 28-kDa heat shock protein, 14-3^>* and 
pra-mRNA splicing factor; O. mesothe&al keratin K7 (type II); £ (from 
top), glutathione S-transferase-ir and roesotheflai keratin K7 (type. ||); 
F (from top and tefl), adenyiyl cyclase-assoclated protein, E-cadherin, 
keratin 19, calgtzzartn, phosphogty cerate mutase,.annexJn IV, cy« 
tosketetal ysctln. hnRNP A% Integral membrane protein catneJdn 
(IPSO). hnRNP H, brain-type ctathHn Dght chain-a, hnRNP F, 704cDa 
heat shock protein, heterogeneous nuclear ribonucfeoprotein A/B, 
transfattonally comrofled tumor protein, liver gJyceratdehyde-3-phos- 
phafe detiydroflenase, keratin 8, aldehyde reductase, and Na,K- 
ATPase 0-1 subunit; Q, (from top and /air), TCP20, calgizzarln, 70- 
kDa* heat shock protein, calnexin, hnRNP H, cytokeratirt 15, ATP 
synthase, keratin 1 9, trfoseptiosphate Isomerase, hnRNP F,Rverg1yc- 
eraidetyaV3-phesphatase dehydrogenase, glutathione S-transfer- 
ase-ir, and keratin 8; H (from left), plasma geteoHn, autoantigen cal- 
retfcutin, mloredoxto, and NAD+-dependent 1 5 hydroxyprostaglandln 
dehydrogenase; / (from top), prOfyi 4-hydroxylase ft-eubunft, cyto- 
keratin 20, cytokeratin 17, prohibition, and fructose 1,6-bJphos- 
phatase; J armexln it; K t annextn IV; L (from top and tefl). 90-kDa heat, 
shock protein, prolyl hydroxylase 0-eubunit, a-enolase, GRP 78, 
cydopnUIn, and cofifln. 

gradient, and having a known chromosomal location, were 
selected for analysis In the TOO pair 827/532. Proteins were 
Identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0.005) between rnRNA and protein alterations (Rg. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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Fta 5. Comparison of protein ami transcript levels in Invasive 
and non-Invasive TCCs. The upper part of the figure shows a 2D gel 
(feff) and the oligonucleotide array irtght} of TCC 632. The reef rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 20 gels of TCCs 532 and 827 are shown below. 
CEeariy, cytokeratlns 13 and 15 are strongly down-regulated In TCC 
827 (red annotation). The tile on the array containing probes for 
cytofceratin.15 Is enlarged below the array {red arrow) from TCC 532 
and Is compared with TCC 827. The upper row of squares Jn each ti!o 
corresponds to perfect match probes; the tower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
specific binding). Absence of signal Is depicted as black, and the 
higher the signal the lighter the color. A high transcript level was 
detected rn TCC 532 (6161 units) whereas a much lower level was 
detected In TCC 827 (absence of signals). For cytokeratin 13, a high 
transcript level was also present En TCC 532 (15659 units), and a 
much lower level was present in TOO 827 (623 units). The 2D gels at 
the bottom of the figure Qaf§ show levels of PA-FABP and adipocyte' 
FABP In TCCs 335 and 733 (Invasive), respectively. Both proteins are 
down-regulated In the Invasive tumor. To the tight we show the array 
bles for the PA-FABP transcript A medium transcript level was de- 
tected tn the case of TCC 335 (1277 units) whereas very tow levels 
were detected In TCd 733 (166 units). /S% Isoelectric focusing. 



keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 19 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4), For example, PA-FABP was highly ex- 
pressed in the non-lnvaslve TCC 335 but lost In the Invasive 
counterpart (TCC 733; see Fig, 5). The smaller number of 
proteins detected in both 733 and 335 Was because of the 
smaller size of the biopsies that were available. 

11 chromosomal regions where CGH showed aberrations 
that corresponded to the changes in transcript levels also 
showed corresponding changes In tine protein level (Table II). . 
These regions Included genes that encode proteins that are 
found to frequently altered in bladder cancer, namely 
cytokeratlns 17 and 20, annexins II and IV, and the fatly 
acid-binding proteins PA-FABP and FBP1, Foot of these pro* 
telns were encoded by genes In chromosome 17q, a fre- 
quently amplified chromosomal area In Invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression in two pairs of non-lnvaslve 
and invasive TCCs using high throughput expression arrays 
and proteomlcs, in combination with CGH. In general, the 
results showed that there b a dear individual regulation of the 
mRNA expression of single genes, which In some cases was 
superimposed by a DNA copy number effect Inmost cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of . detection. !n several cases, how- 
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Proteins whose expression level correlates with both mRNA end gene dose changes 


Protein 


Chromosomal location 


Tumor TCC 


CGH alteration 


Transcript alteration" 


Protein alteration • 


Annexln II 


1Q21 


733 


Gain 


Abs to Pres* 


Increase 


Annexln IV 


2p13 


733 


Gain 


3.9-Fotd up 


Increase 


Cytokeratin 17 


17q12-q21 


827 


Gain 


3.8-Fotd Up 


Increase 


Cytokeratin 20 


17q21.1 


827 


Gain 


5.8-Fold up 


Increase 


(PA-JFABP 


8q21.2 


827 


toss 


10-Fold down 


Decrease 


FBPt 


9q22 


827 


Gain 


24-Fold up 


Increase 


Plasma gelsofo 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heat shock protein 28 


1Sq12-q13 


827 


Loss 


25-Fold up 


Decrease 


ProWbffln 


17q21 


827/733 


Gain 


3.7-/2.5-Fold up 6 


Increase 


Proryl-4-hydroxyl 


17q25 


827/733 


Gain 


5.7V1,e-Fotdup 


increase 


hnRNPBI 


7p15 


827 


Loss 


23-Fold down 


. Decrease 



• Abs, absent; Pres, present 

* In cases where the corresponding alterations were found in both TCCs 827 and 733 these are shown as 827/733. 
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ever, an Increase or .decrease In DNA copy number was - 
associated with cfe novo occurrence or complete loss of tran- . 
script, respectively. Some of these transctlpts could not be 
detected in the non-Invasive tumor but were' present at rela- 
tively high levels In areas with DNA amplifications in the Inva- 
sive tumors (ag. In TCC 733 transcript from cellular llgand of 
annexfh.il gene (chromosome 1q21) from absent to 2670 
arbitrary Units; In TCC 827 transcript from small prollne-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material In this area 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie In the loss of controlled 
methyiation in tumor cells (17-19). Thus, ft may be possible, 
that In chromosomes with Increased DNA copy numbers two 
or more alleles could be demethyiated simultaneously leading 
to a higher transcription level, whereas In chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ptoldy regulation of gene expression In yeast, but In this case all 
the genes were present In the same ratio (22), a situation that Is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects,- 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3tumors, In pTa tumors,. these include 9p-, 9q-, 1q+, Y- 
(2, 6). and injsTI tumors, 2q-,11p-, 1 1q-, 1q+> 5p+, 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and respectively. Pkewlse, the two minimal invasive 
pTI tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 1 1q14-q22 loss, the latter 
often linked to 17 q+(both tumors), and 1q+ and 9p-, often 
linked to 20q+ and 11 q1 3+ (both tumors) (7-9). These ob- 
servations Indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general. Importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases It is only possible to get a crude picture of 
chromosomal Instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with Increased copy numbers. Analysis of these regions 
by positioning heterozygous microsatelfltes as close as pos- 
sible to the locus showing reduced gene expression revealed 
Idss of heterozygosity In several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use ofcDNA mlcroarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed. (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneupioldy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an Impact on the expression level in normal cells and is often 
reduced In tumors. However, the relation between Imprinting 
and gain of chrornosomai material is not known. 

We regard ft as a strength of this Investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urotheiium, as the tumors studied were biologically 
very dose, and probably may represent successive steps In 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available ft was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression Is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (1 0, 23). 

.In the few cases analyzed, mRNA, and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to transiational 
regulation, post-translational processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranstated mRNA pools, which are associated with few 
translationaliy inactive rtbosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very Important fn the case of polypeptides with a short 
half-fife (eug. signaling proteins), A poor correlation between 
mRNA arid protein levels was found in Irver celts as deter- 
mined by arrays and 2D-PAGE (25), and a moderate correla- 
tion was recently reported by Ideker etaL (26) In yeast 
(Interestingly, our study revealed a much better correlation 
.between gained chromosomal areas and Increased mRNA 
levels than between loss of chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in trariscripC) One possible 
explanation could be that by losing one allele the change In 
mRNA level Is not so dramatic as compared with gain of 
material, which can be rather unlimited arid may lead to a 
severaffotd Increase In gene copy number resulting In a much 
higher Impact on transcript level, the latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-f old level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may In the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysts of the effect of loss of chrornosomai areas on tran- 
script leveis. 
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' In eleven cases we found a significant correlation between 
: DNA copy number, mRNA expression; and protein level Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area In chromosome 17q.. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven protelns'is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated Is the large 
extent of protein modification that occurs after translation, 
requiring Immunoldentlflcatlon and/or mass spectrometry to 
correctly identify the proteins in the gets. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
tha pathway from. DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on mlcroarrays wfth many thousand radiation 
hybrid-mapped genes will increase the resolution and Informa- 
tion derived from these types of experiments (Z). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-translatlonal level, a clearer and 5 more developed un- 
derstanding of the tumor genome will be forthcoming. 
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ABSTRACT* 

Genetic changes underlie tumor progression and may lead to cancer- 
spedite expression of critical genes. Over 11 00 . publications have de- ... 
scribed the use of comparative genomic hybridization (CGH) to analyze 
the pattern of copy n amber alterations in cancer, but very few of the gene* 
arretted are known. Here, we performed high-resolution CGH analysis on 
cDNA nikroarrays fa breast cancer and directly compared copy number 
and mRNA expression levels of 13,824. genes to quantitate the. impact of e 
genomic changes on- gene expression. We Identified and mapped the 
boundaries of 24 independent amplicoib, ranging in size from 0 Jt to 12 
Mb, Tbronghout the genome, bcth high- and low-level copy number 
changes bad a snbstantial Impact on gene expression* with 44% of the 
highly amplified genes showing overexpression and. -10,5% of the highly 
overexpressed genes being amplified. Statistical analysis wSm random, 
permutation tests Identified 270 genes- whose expression levels across 14 
. samples were systematically attributable to gene, amplification. T^ese 
included most previously described amplified genes in breast cancer and ' 
many novel targets for genomic alterations, including the BOXB7 gene, 
the presence of which in a novel amplicon at t%q2\3 was validated in 
102% of primary breast cancers and associated with* poor patient prog- 
nosis. In conclusion,' CGH on cDNA micro arrays revealed hundreds of. 
novel genes whose overexpression is attributable to gene amplification* 
These genes may provide insights to the clonal- evolution and progression 
jof breast cancer and highlight promising therapeutic targets. 

INTROBUCTION ' 

.Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-^5). Despite this progress in diagnostic classification, meraolecular 
me ch a nism s underlying gene expression patterns in cancer have re> 
iname^elusive, and the utility of gene expression profiling in the 
identification of specific, therapeutic targets remains ihbsted^ '** 
- 'Accumulation of genetic defects is though! to underlie the clonal / 
evolution of cancer. Identification of the genes that mediate, the effects* 
of genetic changes may be important by highHghtiDg transcripts that 
are actively involved in tumor progression. Such transcripts and their' 
encoded proteins would be ideal targets . for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 andEGFR tf, 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, oyer 
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• Fig. l.mmact of gene copy rnmiber 

over- and uwlerexpreased^genca (Y axis) according to cc^y.ncmbetWc* wdiY 
Tfareshoid voJuea used for over- and underexrnrsafon were >2J84 (gjobal upper 7% of 
me cDNA roues) and <0.4826 (gjobal lower 7% of the expression ratios). B. percentage 
of amplified and feleted genes according to expression ratios. Threshold values tor 
amplification and deletion were >U and <0,7. 



20. recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 3 (9, 10)., However, these amplicbns are often 
large and poorly defined, and their impact on gene expression remains 
unknown^ 

We hypothesized that genome-wide identification of those gene 
expression changes- that, are attributable to underlying gene copy 
number alterations would highlight transcripts that are. actively in- 
volved in the causation or maintenance of. the malignant phenotype. 
To identify such transcripts, we applied a combination ofcDNA and 
CGH microarrays. to: (a) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and (b) identify and characterize those genes, whose mRNA expres- 



- 5 The abbreviations used are: CGH, comparative genornic Hybridization; FISH, fluo- 
rescence in situ hybridization; RT-PCR, reverse transcrlption-FCR. 
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were excluded from the analysis and were treated as missing Values. The 
distributions of fluorescence ratios were used.to define cotpolnts for mcreaseoV 
decreased copy numbcr..Genes with CGH ratio >U3 (representing the upper 
5% of the CGH ratios across all e xperime n ts) were considered to be afnplifiod, 
and genes with ratio <0;73 (reraesenurig the lower $%) were considered to be 
deleted. . 

Statistical Analysis of CGH and cDNA Mkroarray Data. To- evaluate ' 
the influence of copy number alterations on gene expression, we. applied the 
fallowing statistical approach. CGH and cDNA calibrated intensify ratios were 
log-transformed and normalised- using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell tines were 
median centered. .For each gene, the CGH data were represented by a vector 
that was labeled'! for amplification (ratio, > K43) and 0 for no amplification. 
Amplification was correlated wita gene expression using the signal-to-noise 



sion is most significantly associated with ampliricution of the corre- 
sponding genomictemplate., 

MATERIALS AM) METHODS 

Breast. Cancer ChOl X3ne& Fourteen breast cancer cell lines' (BT-20, BT- 
474, HCC1428,'Hs5^8t; MCF7, Mt>A-36i,'MDA«436, MDA-453, MDA-468, 
SKBR-3, T-47D, UACC8I2, ZR-75-J, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas; VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
usmg standard protocols. 

Copy Number and Expression Analyses by cDNA Mteroarrays. The 
preparation and printing of the 1 3, 824 cDNA clones oq glass slides were 

performed as described (1 1-13). Of these clones, 244 represented uncharao- rt " A * Juuwluuu wap conciaica wim gene c 
terized expressed sequence tags, aid the remainder corresponded to known^^^ 08 ' Wc ca * c ^. tcd a weight, w r 
genes. CGH experiments on cDNA microarrays were done as described (14* 
15). Briefly, '20 /tg of genomic DNA from breast cancer cell lines and normal 
• human WBCs were digested tor 14.-18 h with Att& wd J&al (Life Technol- 
ogies, Inc, Rockvflle, MD) and purified by phenoj/chlorofbrm extraction. Six 
MB qf digested ceH line DNAs werij labeled with CyJ-dUlP (Amersham 
Phajniacia) and normal DNA wim CyS-dUTP (Amersham Pharmacia) using 
the Bioprirue Labeling kit (Life Technologies, inc.). Hybridization (14, 15) and 
posthybridizauon washes (13) were done as. described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagenc, 
La JoDa,' CA) was used in all experiments. Forty og of reference RNA were 
labeled .with Cy3-dUTP and 3.5 /*g of test. mRNA wim CyS-dTJTP, "aud-the 
labeled cDNAs were 'hybridized on microarrays as described (13, 1"S). For both 
microarray analyses, a laser confocal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DEARRAY software (16). After background subtraction, 
average intensities at each clone in the' test hybridization were divided by die 
average mtensiry of the corrc^nding clone in the control hybridization,' For 
the copy number analysis, the ratios were normalized on -the basis of the 
distribution of ratios of all* targets on the array and for the expression* analysts 
on the basis of 88 housekeeping genes,, which were spotted four times onto the 
array. Low quality measurements (/.e„ copy, number data with mean reference 
intensity <f00 fluorescent units, and expression data with bom test*. and 
reference intensity <100 fluorescent unite anaVor.with spot size <50 units) 



%"% ... 

where ^si ^ m «d> °x<> denote the means and SDs for tile expression 
levels tor amplified and nonamplified cell lines, respectively. To.assess the 
Statistical significance of each weight; we r*rftrmed 10,000 random' permu- 
tations of the label vector. The probability that a gene' had a larger or equal 
weight by random permutation man the original weight was denoted by a. A 
.low or (<0.05) indicates a strong association between gene expression and 
amplification.' 

Genomic localization, of cPNA Clones and Amplicorr Mapping. Each 
cDNA clone on the microarray was assigned to a Untgene cluster using the 
Unlgene Build 14J. 6 .A database of genomic sequence alignment information 
for mRNA sequences was created from the August 2001 freeze of the Uni- 
versity of CaKfomia Santa Cruz's GoldenPath database. 7 The chromosome and 
bp' positions for each cPNA clone were then retrieved by relating these data 
sets, Amplicons were defined as a CGH copy number, ratio >l0 inat least two 
adjacent clones in two or more cefl lines or a CGH ratio >2j0 in at least three 
adjacent clones in a single cell line. The amplicon start and end posih'ons were - 
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CGBtr'- 



Locason 



Start (Mb) 



lp13 
lq21 
lq22 
3pJ4 

?pl2.1-7plU 

7o3I 

7q32 

8*21.3 

8q24^2 
9pl3 

13422-431. 

17411 ' 

J7ql2-42U 

1742l3i^2IJ3 

17<£2-<g33 

174233-424J 

i9ql3 

20qlI32 

20ql3J2 

20ql3.12^13.13 

2Qql31-ql3.32 



132.79 
17192 
179.23 
7M 
55.62 
125.73 
140.01 
86.45 ; 
98.45 
129.88 
l4ll21 
38.« 
77.15 
86.70 
2930 
•39.79 
52.47 
63.81 
69.93 
• 40.63 
.34i9 
44.00 
46.45 
5132 



End (Mb) 



Sbe (Mb) 



13254 
177,15 
17937 
74.66. 
6055 
13096 
140.68 
92.46 
103X5 
U2.15 
152.1$ 
.39.25. 
8138 
87.62 
30L85 
. 42.80 
55.80 
69.70 
7459 
4U40 
35.85 
45,62 
49A3 
59U2 



.02 
33 
03 
2.7 
53 
52 
0.7 
6.0 
4,6 
123 
1.0 
0.6 
43 
U9 
1.6 
. 3.0 
' 33- 
55 
5.1 
03 
.13 
1.6 
.3.0 
. 7.8 



extended to include neigibpring ncraatnplified clones (ratio, <13). The am- 
plicon rfze determination was partially dependent on local clone density. ' 

F1SB. Dual-color interphase PISH to breast Cancer cell Hoes was done as 
described (17). Bacterial artificial chromosome clone RP11-361K8 was la-" 
Wed with SpectrumOrange (Vysfc, Downer? Grove, IL), and SpectrunV 
OrangeJaboled probe for EGFR was. obtained fiom Vysis. SjjectrumGreen- 
Ubcled chromosome 7. and 17 centromere probes (Vysis) were* used as a 
reference. A tissue microarray containing 612 fbnnalin-fbced, parafHn^mbed- • 
ded primary breast cancers (17) was applied in FISH analyses as described 
' ( 18 >- The use of these specimens was approved by the Emics Committee of the 
University of Basel and by die NIR 'Specimens containing a 2-fold or higher 
tecrease in the number of test probe' signals, as compared with corresponding 
centromere, signals, in at least 10% of the tumor cells were considered to be" 
amplified, Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test 

RT-PCR. The BOXB7 expression level was .&*erxrine4 relative to 
GAPDH. Reverse transcription and PCR amplification were performed using. 
Access RT-PCR System (Promega Con%, Madison, WI) with 10 ng of mRNA 
as a template. HOXB7 primers woe 5'-GAGCAOAGGOACTCOOACTT-3' 
end 5' ^GCGTCAdaTAGCOATTGTA(J-3'. 

H > ' . 

RESJJLTS-. • . " - 

Global Effect ot Copy Number, on Gene Expression. 13,824 
arrayed' cDNA* clones were applied for aiialysis of gene expression 
and gene copy number (CGH iricroarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
-en gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2S) were overexpressed (U t belonged to . 
the global tipper 7% of expression ratios), compared with only €% for 
genes with normal copy number levels (Fig/1A). Gonverselyi 10.5% 
of the transcripts with higk-level expression (cDNA ratio, >1Q) 
showed, increased copy number (Fig. IS). Low-level copy number 
increases and decreases were also associated with similar, although' 
less dramatic, outcomes on gene expression (Fig.- 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for .1 1,994 cDNAs (86.8%) were used to plot copy 
• number changes as a function of genomic position (Fig. 2, Supple- , 
meat Fig, A). The average spacing of clones'throughout the genome 
was 267 kbi. This hljgh-resoiution mapping identified 24 Independent 
breast cancer amplicons, spanning from 0.2 to-12 Mb ofDNA (Table 
1). Several amplification sites detected previously by chromosomal 



CGH were validated, with lq21, 17ql2-q21.2> 17q22^q2S, 20ql3.i 
and 20ql3.2 regions being most corrunonly amplified. Furthexnjore! 
the boundaries of these ampHcons were precisely delineated, In ad- 
dition, novel ampljconsvwere identified at 9pl3 (38.65-39.25 MbV 
andl7q2U(514T-55,80Mb), . * 
Direct Identification of Putatlye Amplification Target Genes, 
The cDNA/CGH micfotaay technique enables the direct eonela'. 
tion of copy number find expression data- on a gene-by-gene basis 
throughout the genome. We directly annotated, hi^resbhrtion 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes" in the MCF-7 breast cancer 
cell line at lpl3, 17q22^q23, and 20ql3 wete : bigjiiy overex- 
pressed. A view of chromosome 7 hi the- WDA-468 coll line 
implicates EGF& as, the most highly oyeiexpressed and amplified 
gene at 7pll-pi2 (Fig.. 34). In BT-474, me two known amplicons 
at .17ql2,and 17q22-q23 contained numerous, highly overex- 
pressed genes (Fig. 32?). In addifion, several gente, including the 
homeobox genes HOXB2 and HdXBi, were highly amplified in a 
. previously undescribed independent aniplicon.at. 17g2i.3, UOXB7 
was systeniajdca^^ ZB^psei) 
as wen as overexpresseii {as verified.by RT-PCR, data not shown) 
in BT-474, OACC812, and2R-75-30 cells, furdiermore, this ncVel 





- -o. . ™, — v« ,„ 6W v w^iw4on data on CGH mlcrcarray amfilea. 4 genes in me 
J 1 LtE^ 000 fa thc ceU Use arc' highly expreued {red Hots) aod include 



Ite EGFR oncogene. B. Kvad genca In die 17ql2,. 17(^1 3, and J7q23 tmpllona b the 
«D line are highly oviaeiqaessed («rf> and Include (he H0XB7 
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BT^^ieastcancercdl^.^. « — ^ wim ^ aujus/ 
.The data labeb and color coding are as indicated for Fig. 2C facts show 
chromosonial CCH proxies for the corresponding chromosomes and validation of the 
increased copy cumber by interphase PISH nshjg BGFR {red) and chromosome 7 

n^Z5 D 7^^> ) to ^^S^^ ^ 7 ^c' Probe (mOandchro- 
mospme 17 centromere (green) to BT-474 ceils (fl).- 
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level fopy nombcr increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications. However, the impact of 
low-level gains on the dysregulation of gene expression patterns in 
cancer may be equally important if not more important than mat of 
tigh-level amplifications. Anenploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on hie. impact of aneuploidy on global gene expression pat- 
terns .in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system .(22-24)* 

The CGH rmaoarray analysis identified 24 independent breast, 
cancer ainplicons. We defined the precise boundaries for, many am- ' 
plfcons detected previously by chromosomal CGR (?, 10, 25, 26) and 

. also discovered novel amplicons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons," One of these novel amplicons 
involved the homeobox gene region at 17q21 3 and led to file over- 
expression of the HOXB7 and HOXB2 genes. The homeodoinain 
tnaiscr^tion ; factors known to be key regulators of embryonic 
development andhave been occasionally reported to undergo aberrant 
egression in cancer (27, 28). HOXB7 transection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tnmorigenicity and angiogenesifl in breast cancer (29-32), The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing &OXB7 in . breast cancer and suggest that 
HOXB7; contributes to tumor progression and confers an aggressive 
disease phenorype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well aa an association of amplification With poor prognosis 

: of the patients. ' 

- We carried out a systematic search to identify .genes whose 
expression levels across all 14 cell lines were attributable to 
amplification- status.. Statistical analysis revealed 270 such- genes, 
(representing ~2% of all genes on the array), including not only 
.previously. described amplified genes, such as HER-2, MYC t 
EGFR> ribosornal protein s6 kinase, and AIB3, but also numerous 
novel genes such BsNRAS-relatedgene (lpl3), syndecan-2 (8q22), 
and bone morphogenfc protein (20ql3.1), whose activation by 
amplification inay similarly .promote breast cancer progression. . 
Most of the 270. genet have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of thern to be causally 
involved, it is. intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a'possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological' insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 
. In summary, we.demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels- of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 261 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high-resolution map .of 24. independent 
arnplicons in breast cancer; and (c) identification of a set of 270 
genes, 'the overexpression of which was statistically attributable to 
gene amplification;- Characterization of a novel amplicon at 
17q21.3 implicated amplification and overexpression . of the 
HOXB7 gene in breast cancer, including a clinical association 



between HOXB7 amplification and poor patient prognosis. Overall, 
our results illustrate how the .identificarioi of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as. well as'to prioritize and 
validate putative > targets for therapy development 
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Genomic DNA copy number alterations are key genetic events In 
the development and progression of human cancers. Here we 
report a genome^de microarray comparative genomic hybrid- 
ization (array CGH) analysis of DNA copy number variation In 
a series of primary human breast tumors. We have profiled DMA 
copy number alteration across 6.691 mapped human genes. In 44 
predominantly advanced, primary breast tumors and 10 breast 
caricer cell fines. While the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
re^itjtton Cgene-by-gene) mapping of ampllcon boundaries and 
the quantitative analysis of ampHctm shape provide significant 
improvement m the localization of candidate oncogenes. Parallel 
microarray measurements, of mRNA levels reveal the remarkable 
degree to which variation In gene copy number contributes to 
to gene expresrfon m tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number; Influences gene ex- 
pression across a wide range of DNA copy number alterations 
(cteletton. low-, mid- and higrvlevei amplification), that on average, 
a Mold change- in DNA copy number Is associated with a corre- 
sponding 1 .5-fbtd change In mRNA levels, and that overall at least 
12% of an the variation In gene expression among the breast 
tumors b dl rectiy attributable to underlying variation m gene copy 
number, these findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 
genomic ^bddization (CGH) (1), have led to the identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors 0-4). While 
some of Aese regions contain known or candidate oncogenes 
[*f , FGFRl (8 P ll), MYC (8q24), CCND1 (llq!3), ERBB2 
ft* 2 )'™* 7 * 1 ™ <20ql3)J and tumor rkppreior genes 
[RBI (13414) and TP53 (HpS)], the relevant gene(s) 4hin 
other regions (e.g., gain of lq, 8q22, and I7q22-24, and loss of 
8p) remain to be identified A high-resolution genome-wide 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization arid identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer. In this study, we have created such a map, using 
array-based CGH (5-7) to profile DNA copy number alteration 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel in 
the same samples (S% using the same DNA microarrays, we bad 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression; From 



this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods 

iruraers and cell tines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50% being lymph node positive. The 
fimction of tumor cells within specimens averaged at least S0%>. 
Details of mdmduaj tumors have been published (8, 9Y, and 
are summarized in Table 1, which is published as suWortine 
irrfoimation on the PNAS web site, www.pnas.org. Breast cancer 
cell lines were obtained from the American Type Culture 
CoUechort. Genomic DNA was isolated cither using Qiagen 
genomic DNA columns, or by phenol/chloroform extraction 
followed by ethanoi precipitation. 

DNA Labeling and Microarray Hybridizations. Genomic DNA labcl- 
ing and hybridizations were performed essentially as described 
1 »S5?** 7^ modifications. Two micrograms 

of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted according^. Test" DNA 
(trom tumors and cell lines) was fluorescent^ labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (Le„ UniGene dusters), the 
reference" (labeled with Oft) for each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of cDNA microarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GcnePix scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated using scan alyze 
software (available at bttp://raiuUblgov> Huorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to 0. Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the suprwrtinl 
^formation). When indicated, DNA copy number profiles are 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions, for arrayed human cDNAs were assigned by 



Abbreviation: CGH, comparative genomic hybridization. 
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JdenUft^g the starting position of the best and longest match of 
any DNA sequence represented in the corresponding UniCene 
cluster^lO) against the -Golden Path" g^ome assembly 
(ht^://genome.ucsc.edu/; Oct 7, 2000 Freeze). For UnlGenc 
clusters represented by multiple arrayed elements, mean f luo- 
"jwnce ratios (for all elements representing tie same UniGene 
t*^ *Z rep0rted ' For ***** measurements, fluorescence 
ratios are mean^entered" (ie. reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety m the suprjorring information. 

Results 

We performed CGH on 44 predominantly locally advanced 
praiary breast tumors and 10 breast cancer cell lines, ustna 
CDNA microarrays containing 6,691 different mapped human 
genes (Fig. la; also see Materials and Methods for details of 
microarray. hybridizations). To take full advantage of the im- 
proved spatial resolution of array CGH, we ordered (f iuorcs- 

ESS £^//° r) ** 6m ^ M ^ding to the Golden 
ram (http://gcnome.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g., 

T*£? ^ am P Kcons )» b «< ^so Provide precise 
genetic landmarks for chromosomal regions of amplification and 
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deletion. Parallel analysis of DNA from cell lines contatoin* 
diaerent numbers of X chromosomes (Fig. 16), as we did before 
(7). demonstrated the sensitivity of our method to detect smale. 
copy loss (45, XO), and 1.5- (47,XXX), 2r (48*XXX)fo7 
25-fold (49.XXXXX) gains (also see Fig, 5, whfehispuhnsied 
as supporting ^formation on the PNAS web site). Fluorescence 
•ratios were linearly proportional to copy number ratios, which 
were slightfy underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident far both the breast cancer cell hues and primary tumors 
(Fig. la), detected in the tumors despite the presence of euoloid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss- were readily iden- 

?^SL^ ?^£3^ on of brcast ««« ceD Imes/tumors 
(90%/69%, 100%/47%, 100%/60ft, and 90%/44% # respective- 
ry), as were losses within Ip, 3p, 8p, and i3o «Q%/24% 

^U, puy,shed cytogwusacstacEcs (re& 2-4; a complete listing 
of gains/Iossesisprovided In Tables 2an<J 3. which are published 
as supporting information on the PNAS web site). The total 
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of Xtfromosomes. for breast cancer ceil lines, and for breast tumors. Breast cancer cell Hries and tumors am separately ordered by hierarchical clustering to 
highlight recurrent copy number changes. The 241 genes present on the mkromys and mapping to chromosome 8 are ordered by position ak*iq the 
chromosome. Ruorescenceratios (test/reference) are depicted by a log, pseudocolor scale (Indicated). Selected genes are Indicated wi* itor<oded text (red. 
increased; green, decreased; Mac*, no change; gray, not well measured) to reflect correspondingly altered mRNA levels (observed in the majority of the subset 
of samples displaying the DNA copy number change). The map positions for genes of Interest that are not represented on the mlcroarray are Indicated in the* 
row above thoie genes represented on the array. (©) Graphical display of DMA copy number profile for f 



<twmor/normaO are plotted onalogj scale for diromosome 8 genes, ordered along the chromosome. 



for breast cancer ceTi tine SKBR3. Fluorescence ratios 



number of genomic alterations (gains and losses) was found to 
be significantly higher in breast tumors that were high grade (P = 
0.008), consistent with published CGH data (3), estrogen recep- 
tor negative (P » 0.04), and harboring TP53 mutations (P - 
0.0006) (see Table 4, which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array CGH analysis is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a). The complexity of amplicon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3: Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed .three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 26). For each of these regions we can define the 



vvv — w — T<« iwuuvnujr amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as weD as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supportfoe 
information on the PNAS web site). 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6\0951 mRNA 
levels were quantitatively measured in parallel by using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful In the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident m an examination of the pseudocolor representations 
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ih,^^^ b ! tween JW numbered gene expression acras chromosome 17. DNA copy number alteration ttfrpef) and mRNA levels Ucwe/0 
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of DNA copy cumber and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; Lex, a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is* not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DMA. amplifica- 
tions (fluorescence ratios >4»' and. representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the WAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

To determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also .associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Rg. 4a). For both the 



breast cancer ceil lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant feshion (t> values for pair-wise Student's 
. r tests comparing adjacent classes: cell lines, 4 x lO" 4 *, 1 x 10" 49 . 
5 X 10,-* 1 X ltM- tumors, 1 X HT 43 , 1 X lO"* 4 5 X 10^ 
1 x 10~ 4 ), A linear regression of the average Iog(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
number was accompanied by 1,4- and 1 5-fold changes to mRNA . 
level for the breast cancer cell lines and tumors, respectively (Kg. 
Aa f regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy nam-, 
bcr and mRNA level, each across the 37 tumor samples (Fig. 4b). 
The distribution of correlations forms a normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced in a plot 
of observed vs. expected correlations (Pig. 4c), and reflects a ■ 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the ritfit tail of the distribution 
m Fig. 4b) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation raeasurecl in mRNA levels among the 37 
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tumors that could be attributed to underlying variation in DNA 
copy numl?er. From this analysis, we estimate that, overall, about 
7% of all of the observed variation in mRNA levels can oe 
explained directly by variation in copy number of the altered 
genes (Pig. Ad). We can reduce the* effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence intensity/ 
background >'3); using that data, our estimate of the percent 
variation in rnRNA revels directly attributed to variation in gene 
copy number increases to 12% (Fig. 4#. This stiU.undoubtedly 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
usefulness of defining anrolicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing rnRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we ate likely underestimating the 
contribution of DNA copy number' changes to altered gene 
expression, we believe our findings are likely to be generalizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes). 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression In cancer cells. In 
agreement with our findings, Phillips et aL (14) have shown that 
with the acquisition of tumorigenicity in an fanmortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et al (15) recently reported that in metastatic 
colon, tumors only -4% of genes within amplified regions were 
found more highly (>2-fold) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. ¥oi eaatnple, the study of Platzer et at (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within uaamplifled chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on tower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity amplicons, effectively overtoiling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may. represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject t? specific autoregulation or dosage 
compensation. Third, this rinding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully fi2<& of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape {to identify the "driving*' gene(s) 
• within amplicons (16)1, on a large number of samples, In addition 
to functional studies. Fourth, this rinding suggests that analyzing 

1. Kslltonicmi, Kallfotticrai, O. P., Sudor, IX, RutoWb, D„ Gray, J. W„ 
WaWman, F. & Kntel, D. (1992) Science 258, S18-82J. 

2. KaJUoniemi, A., Kallioaiemi, O. P„ Piper, J, Tanner, M H Stokke, T, Chen, JL, 
Stmth,RS,Fwk«l l D.,Gny.J.W.&WaJ^ 

Sd. USA 91, 21 56-2 16a 

3. Tutkoncn, M, fanner, M, Kwhu, KalUontoni, A, Isota, J. & KalEonkmi 
' O. P. (1998) Cena Chromosomes Cancer 21, 177-184. 

4. Foreran, F, MaWa maid, & H, Monnf, 0„ Chen, Y„ Vddmao, R, Jiang, Y, 
Goodco, G. C. Etokr, S. KaUfodietn^ A. A KalUonlcmi, O. P.<2000) Cancer 
Re* «, 4519-4525. 

5. Solinas-Taldo, S H Lampd, Stilge nbauer, Nk±b2crJco, J„ Bcaner, A, Dohncr, 
H, Cramer, f . & Lfcbicr, P. ( 1997) Cases Chromosomes Cancer 20, 399-407. 

6. Pinket, D* Scgraves, EL, Sudar, D„ dark, &, Poole, U Kowbd, Collins, O, 
Keo, W. U Chen, C, Znai, Y.,enrf. (1998) Ate/. Gen*. 20, 207-211. 

7. PoUacM.R. t Peroo, C M, ACzodoh, A A, Ekcn, M- B„ PcrgomeosctiLkov, 
A^ Willfenu, C. F„ Jeffrey S. S, Botstcm, D. ABrown, P. O. (1999) NaU OtncL 

42-46, 

a Pemu, C M, Sorl«i, Eiscn, M, B„ van do Ryn, NL, Jeffrey, S. Rcc\ C A, 
Pollaeli J. R, Res, D. Johnten, H, Akslen, L. A., esal (2000) Nature 
(Lori&on) 406, 747-752. 
. 9. Sornc,T.,i>eTou,aM^Titeh^ 



the genomic distribution of expressed genes, even within existing 
nticroarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Hg, 3 and supporting information). Fifth, this 
finding, implies that a substantia] portion of the phenotypic 
uniqueness (and by extension, the heterogeneity in clinkaJ 
behavior) among patients 1 tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this rinding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis. (17, 18), beyond the amplification of specific 
oncogenes » and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochiometric relationships in cell metabolism and physiology 
(e.g., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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Abstract 

Background: Colorectal cancer is a common cancer all over the world. Aberrations in the cell 
cycle checkpoints have been shown to be of prognostic significance in colorectal cancer. 

Methods: The expression of cydin Dl r cydin A, histone H3 and JG-67 was examined in 60 colorectal 
cancer cases for co-regulation and impact on overall survival using immunohistochemistry, 
southern blot and in situ hybridization techniques. Immunoreactivity was evaluated semi 
quantitatively by determining the staining index of the studied proteins. 

Results: There was a significant correlation between cydin Dl gene amplification and protein 
overexpression (concordance ~ 63.6%) and between Kf-67 and the other studied proteins. The 
staining Index for K*-67, cydin A and 01 was higher in large, poorly differentiated tumors. The 
staining index of cydin Dl was significantly higher in cases with deeply invasive tumors and nodal 
metastasis. Overexpression of cydin A and 0/ and amplification of cydin Dl were associated with 
reduced overall survival. Multivariate analysis shows that cydin Dl and A are two independent 
prognostic factors in colorectal cancer patients. 

Conclusions: Loss of cell cycle checkpoints control is common in colorectal cancer. Cydin A and 
Dl are superior independent indicators of poor prognosis in colorectal cancer patients. Therefore, 
they may help in predicting the clinical outcome of those patients on an individual basis and could 
be considered important therapeutic targets. 



Background 

Colorectal cancer (CRC) is the third most common cancer 
in Western countries (1). In Egypt, CRC has unique char- 



acteristics that differ from that reported in other countries 
of the western society. It was estimated that 35.6% of the 
Egyptian CRC cases are below 40 years of age and patients 
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usually present with advanced stage, high grade tumors 
that carry more mutations [2]. This uniquely high propor- 
tion of early-onset CRC, the early and continuous expo- 
sure to hazardous environmental agents, the different 
mutational spectrum and the prevalent consanguinity in 
Egypt justify further studies [3|. It was proved that most 
cancers result from accumulation of genetic alterations 
involving certain groups of genes, the majority of which 
are cell cycle regulators that either stimulate or inhibit cell 
cycle progression [1]. Cell proliferation allows orderly 
progression through the cell cycle, which is governed by a 
number of proteins including cyclins and cyclin dependent 
kinases (4,5). The cyclins belong to a superfamily of genes 
whose products complex with various cyc/m-dependeni 
kinases (cdks) to regulate transitions through key check- 
points of the cell cycle (6). Abnormalities of several cyclins 
have been reported in different tumor types, implicating, 
in particular, cyclin A, cyclin E and cyclin D [6,7|. 

Cyclin Dl is a Gl cyclin that regulates the transition from 
Gl to S phase since its peak level and maximum activity 
are reached during the Gl phase of the cell cycle. Whereas 



cyclin A is regarded a regulator of the transition to mitosis 
since it reaches its maximum level during the S and G2 
phases [8]. The mechanisms likely to activate the onco- 
genic properties of the cyclins include chromosomal trans- 
locations, gene amplification and aberrant protein 
overexpression [7,9]. 

Several studies have shown that, hisione H3 mRNA expres- 
sion can be used to identify the S phase fraction (SPF) 
through the in situ hybridization (ISH) technique 1 10,1 1 ]. 
The level of hisione H3 mRNA reaches its peak during the 
S phase and then drops rapidly at the G2 phase [ 12|. 

In face of the increasing incidence of CRC and its peculiar 
pattern in the Egyptian population, the present study was 
conducted to assess the role of Ki-67 (pan-cell cycle 
marker), cyclin Dl (Gl phase marker), hisione H3 mRNA 
(S phase marker), cyclin A (S to G2 phase marker) in CRC. 
The expression level of these markers was correlated to the 
clinicopathologic features and the overall survival of 
patients. 



Table I : CHnico pathological features of patients In relation to the staining index (SI) of Ki-67, cyclin Dl, cyctin A, h/stone H3 

SI (mean + SD) 



Variables 


No. of cases 


10-67 


Cyclin Dl 


Cyclin A 


Histone H3 


Sex 












Male 


36 


18.0 ±6.4 


6.7 1 4.3 


12.715.7 


10.7 1 5.3 


Female 


24 


20.1 ±5.8 


8.8 1 8.4 


10.016.0 


10.715.4 


Age (yean) 












250 


41 


11.7 ±6.0* 


5.6 ± 5.2 


10.0 1 5.3 


6.0 1 5.0* 


<S0 


19 


23.8 1 5.6 


7.7 ± 6.8 


13.615.7 


2X0 1 5.2 


Tumor size (cm) 












<5.0 


33 


12.216.3* 


5.3 1 3.8* 


11.516.1* 


10.314.9* 


25.0 


27 


30.1 1 6.2 


22.817.2 


28.6 1 5.6 


24.0 ± 5.6 


Histology 












Normal 


20 


3,5 ± 2.0* 


0.6 ± 0.2* 


2.3 1 t.l* 


2.2 1 0.9 


Carcinoma 


60 


30.3 ± 6.2 


24.9 1 6.3 


27.2 1 5.8 


10.715.3 


Gl 


15 


117 ±6.2 


6.6 ± 4.0 


10.015.4 


11.414.9 


Gil 


21 


ll.8±5.6 


8.9 ± 3.6 


12.316.5 


7.8 1 5.4 


GUI 


24 


30.0 1 4.3 


22.018.1 


27.0 1 4.9 


11.51 5.4 


Lymph node 












Negative 


33 


19.517.0 


5.4 1 5.3* 


11.9163 


12.3 1 5.5 


Positive 


27 


21.3 14.9 


20.6 1 6.9 


12.5 15.0 


14.215.0 


Depth of Invasion 












m, sm 


17 


20.7 ± 6.7 


3.1 13.1* 


11.917.2 


10.4 15.1 


beyond sm 


43 


21.916.2 


12.4 i 6.5 


12.2 1S.6 


10.715.4 


Stage 












1 


6 


20.6 ± 6.7 


5.7 1 6.9 


24,2 ± 6.9 


U.I ±5.3 


11 


27 


20.8 1 6.9 


5.3 1 4 J 


24.6 1 6.0 


10.4 1 5.7 


III 


12 


22.015.4 


7.7 1 6.0 


27.1 15.2 


10.4 1 4.9 


IV 


15 


24.716.1 


11.319.6 


27.5 1 53 


12.3 ±6.2 



* p. value < 0.05 (significant) 
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Methods 
Tissue samples 

Paraffin-embedded tumor tissues were obtained from 60 
CRC patients (47 colon and 13 rectal carcinomas) that 
were diagnosed and treated at the National Cancer Insti- 
tute, Cairo, Egypt during the period from January, 1997 to 
June, 2002. Clinicopathological data of the studied cases 
are illustrated in table 1. None of the patients received any 
chemotherapy or irradiation prior to surgery. Histological 
diagnosis of all cases was done by 2 independent pathol- 
ogists according to the WHO Histological Classification. 
Tumors were staged according to the TNM staging system 
1 13]. The depth of tumor invasion was classified as inva- 
sion of the mucosa including muscularis mucosa (m), 
invasion of the submucosa (sm), or invasion beyond the 
submucosa [8]. Normal colonic tissues were obtained 
from autopsy specimens (n = 20) and were used as a con- 
trol. The actual survival rate of the patients was calculated 
from the date of resection to the date of death. 

Immunohistoch emistry 

Four micron sections of each normal and tumor specimen 
were cut onto positive-charged slides; air dried overnight, 
de-paraffinized in xylene, hydrated through a series of 
graded alcohol and washed in distilled water and 0.01 
PBS (pH 7.4). Slides were then processed for 1HC as 
described by Handa et al. [8]. using the following anti- 
bodies: Ki-67 (MIB-1, Dako), cyclin A (6E6; Novocastra, 
Newcastle-Upon-Tyne, UK) and cyclin Dl (DCS-6, Dako). 
A case of invasive breast cancer was used as a positive con- 
trol for Ki-67 and cyclin A whereas a case of mantle cell 
lymphoma was used as a control for cyclin Dl. Negative 
controls were obtained by replacing the primary antibody 
by non-immunized rabbit or mouse serum. 

Brown nuclear staining was regarded as a positive result 
for all studied markers. The proportion of positively- 
stained cells and the intensity of staining were scored in 
tumor and normal colorectal mucosal sections at medium 
power (*200). The degree of positive tumor staining (per- 
centage of positive tumor cells in the examined section) 
was scored from 1-6 and the staining intensity was scored 
from 0-6 according to the pattern of staining in the exam- 
ined section. Staining index (SI) was calculated by multi- 
plying the cellularity and staining scores as described by 
King et al. [14]. 

In situ hybridization 

All tumor samples and 5 normal controls were assessed 
for histone H3 mRNA by ISH using the commercially avail- 
able 550 base fluorescein-labeled DNA probe (Dako, 
Carpinteria, CA) as described by Nagao et al., 1996. This 
probe hybridizes to the whole mRNA transcript of the 
human hi$toneH3 gene including the5' and 3' untrans- 
lated regions. Scoring of histone H3 mRNA was performed 



as for immunohistochemisuy, however, hybridization 
signals were detected in the cytoplasm. 

Molecular detection of cyclin Dl gene amplification 
High molecular weight DNA was extracted from paraffin- 
embedded tissues of the tumor and normal colorectal 
mucosal samples as previously described [15|. The pro- 
portion of neoplastic and normal cells was determined in 
each tumor sample by examining hematoxylin and eosin- 
stained slides obtained from the edge of the specimen 
used for DNA extraction. Tumor samples were evaluated 
for amplification of cyclin Dl if more than 75% of the 
examined sections were formed of neoplastic cells. 
Accordingly, 50 cases were eligible for the analysis. Ten 
micrograms of the extracted DNA was digested with 
EcoRl. DNA from selected cases was also digested with 
BgJII and Hind HI. Samples were separated on 0.8% agar- 
ose gels and transferred to Hybond-N membranes (Amer- 
sham Int., Amersham, UK). The membranes were 
hybridized with 50% formamide, 5 x SSC, 5 * Denhardt's, 
500 ug/ml denatured salmon sperm DNA, 10% dextran 
sulphate and 10°cpm/ml of 32 P-labeled PRAD-l probe for 
24 h. Membranes were washed with 2 x SSC, 0. 1% SDS at 
room temperature for 30 min followed by 2 x SSC, 0.1% 
SDS at 60°C for 30 min and 0.1 x SSC, 0. 1% SDS at 60°C 
for 1 h. Filters were autoradiographed using an intensify- 
ing screen at -70° C for 24-72 h. After being stripped free 
of the PRAD-l probe, the same blots were hybridized with 
32 P-labeled B-actin probe to normalize against possible 
variations in the loading or transfer of DNA. The autora- 
diograms were analyzed using a densitometer. Intensities 
o( PRAD-l /cyclin Dl were normalized to the fl-actin con- 
trol bands. The degree of amplification was calculated 
from these normalized values. Amplification was consid- 
ered when the signal of the tumor band was >2-fold the 
value of the matched normal mucosa [16]. 

Statistical analysis 

The Mann-Whitney non-parametric test was used to com- 
pare the Sis of pairs of subjects whereas the Kruskal-wallis 
was used for categorial data. Correlation between indices 
was performed using a simple linear regression test. The 
Kaplan-Meier method was used to create survival curves 
which were analyzed by the log-rank test. The impact of 
different variables on survival was determined using the 
Cox proportional hazards model, p. values less than 0.05 
were considered significant. 

Results 

The results of IHC are illustrated in figures 1 and 2. In gen- 
eral, the staining index (Sis) of all studied markers was 
higher in carcinomas than in normal colonic mucosal 
samples (p = 0.0001). Normal colorectal mucosa revealed 
positive imunostaining for Ki-67 in the lower half of the 
crypts only. A heterogeneous staining pattern was 
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Figure 1 

Norma! colonic mucosa showing positive nuclear immunostaining for: (a) cydin D/, (b) ISH of histone H3 mRNA, (c) Kh67 and 
(d)cye/wA 



detected in the neoplastic cells of well and moderately-dif- 
ferentiated adenocarcinomas whereas a diffuse homoge- 
neous staining pattern was detected in poorly- 
differentiated carcinomas. The SI ranged from 10-40.2 
(mean: 24.6 ± 6.5). 

Immunostaining for cydin Dl was predominantly nuclear 
but cytoplasmic staining was detected in some cases. 
However, unless a nuclear staining was also detected, 
cases with cytoplasmic staining were considered negative. 
Normal colorectal mucosal samples were almost negative 
for cydin Dl whereas 4 1 out of the 60 (68.3%) CRC cases 
were positive. Marked heterogeneity was observed in well- 
and moderately-differentiated adenocarcinomas even 
within the same tumor. Poorly-differentiated carcinomas 
revealed a diffuse staining pattern with more darkly- 
stained nuclei. The SI ranged from 0.5-28.6 (mean: 9.3 ± 
4.2). 



Positive nuclear staining for cydin A was detected in 80% 
(48/60) of CRC cases and in all non-neoplastic control 
samples. Positively-stained nuclei were confined to the 
lower half of the crypts in normal colonic mucosa and dif- 
fusely-dispersed in carcinomas. The SI ranged from 3.3- 
30.2 (mean: 15.1 ±6.6). 

Histone H3 mRNA was intensely expressed in the cyto- 
plasm of all examined samples either neoplastic or non- 
neoplastic. The distribution of histone H3 mRNA was 
similar to that of cydin A and Ki-67 however, the propor- 
tion of histone H3 mRNA positive cells was less than that 
of Ki-67. The SI ranged from 1.8-24.2 (mean: 12.4 ± 5.3). 

The PRAD-1 probe detected 3 EcoRI fragments of 4.0, 2.2 
and 2.0 and 1 Bglll fragment of 15 Kb. PRAD-l/cydin Dl 
gene amplification was detected in 22/50 (44%) cases 
analyzed. The degree of amplification was heterogeneous 
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Figure 2 

A case of well differentiated adenocarcinoma with positive immunostaining for (a) cydin D/, (b) hfstone H3 mRNA, (c) Kf-67, 
and (d) cydin A. Another case of moderately differentiated denocarcinoma with positive immunostaining for: (e) cydin Dl, (f) 
histone H3 mRNA, (g) Kf-67, and (h) cydin A. A case of poorly differentiated adenocarcinoma with diffuse staining for: (i) cydin 
D/, 0) ISH of histone H3 mRNA, (k) Kh67 and (I) cydin A. 



with 2-10 fold increase when compared to normal 
mucosal samples (Figure 3). Amplification was confirmed 
by other restriction enzymes. 

Cbrre/otfojis 

There was a significant correlation between cyclin Dl gene 
amplification and protein ove repression. Out of the 22 



cases that showed amplification 14 showed protein over- 
expression (concordance = 63.6%). 

Linear regression analysis of Sis revealed a significant cor- 
relation between Ki-67 and cyclin Dl, cyclin A, histone H3 
as well as between the Sis of cyclin A and histone H3 (p = 
0.008, 0.0001, and 0.0001 respectively) (Figure 4). There 
was a significant relationship between the SI of both Ki-67 
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Figure 3 

A: Southern blot analysis of normal mucosa (N) and their seven corresponding cases of colonic adenocarcinomas (TI-T7), 
cases No. I, 2, 4, and S are poorly differentiated whereas cases No. 3* 6, and 7 are moderately differentiated. Genomic DNA 
was digested with Bg/H, fractionated by electrophoresis in agarose gel, transferred onto membranes and hybridized with PRADI 
and fl-octsn. Tumors number I-6 (Lanes I-6) show different degrees of PRADI fcyd'm D I amplification, tumor number 7 (lane 7) 
was not amplified. B: Southern blot analysis of 3 cases of adenocarcinomas (T) and matched normal colonic mucosa (N). 
Genomic DNA was digested with EcoRI, fractionated by electrophoresis in agarose gel, transferred onto membranes and 
hybridized with PRADI and fiuctin probes for loading control. The identification of the 3 tumors is the same as in Fig. 3 A with 
amplification of PRADIfcydin Dl in tumors number 4, S (Lanes I, 2) but not 7 (Lane 3). 



and cyclin A and the degree of differentiation of tumors as 
well as the size of the tumor (p < 0.001 and p < 0.01 
respectively). In addition, SI of Ki-67 and histone H3 were 
higher in patients <50 years than in those £50 years (p < 
0.05) (table 1). 



In addition table 2 shows a significant relationship 
between high cyclin Dl SI and large, poorly-differentiated 
tumors, carcinomas with positive lymph node metastasis 
and deeply-invasive carcinomas (p < 0.05, p < 0.001, p < 
0.05 and p < 0.05 respectively). Whereas cyclin Dl gene 
amplification was significantly associated with an 
advanced disease stage since amplification was detected in 
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Ki-67 SI D Cyclin A SI 



Figure 4 

Correlation between the staining intensity of (a) Ki-67 vs. cyclin D/, (b) Kh67 vs. histone H3, (c) 10-67 vs. cydin A and (d) cydin A 
vs. histone H3 mRNA expression. 



10/15 (66.7%) of stage IV tumors compared to 12/45 
(26.7%) of stage I-III tumors (p « 0.002). Similarly, DNA 
amplification was detected in 60.5% (26/43) of the carci- 
nomas with extensive local invasion (beyond sm) but 
only in 23.5% (4/17) of the carcinomas with limited inva- 
sion (m, sm) (p - 0.001). A significant correlation was 
also present between cydin Dl gene amplification and the 
presence of lymph node metastasis (p ■ 0.008) as well as 
between the SI of histone H3, the size of the tumor and the 
patient's age (p < 0.05, p < 0.001 respectively). The SI was 
higher in tumors >5 cm in diameter and in patients <50 
years. 

Survival analysis 

The mean follow-up period for all patients was 30 months 
(range: 1-66 months). Eighteen of 60 patients had 
already died by the time the study was completed. We 



defined the cutoff level for overexpression of each cell 
cycle marker at the point that showed the maximum dif- 
ference of survival rate between the 2 groups separated by 
that point. Cox regression analysis revealed that cyclin A 
overexpression (our definition: SI £ 10.5), cyclin Dl over- 
expression (our definition: SI £ 6.1 ), poorly differentiated 
histology, lymph node metastasis, TNM stage, tumor size 
and depth of invasion were all significant prognostic var- 
iables for survival (Table 3). The Kaplan-Meier survival 
curves for the subgroups of patients who are subdivided 
according to each marker's status are shown in Figure 5. 
Patient with tumors that showed Ki-67 overexpression 
(our definition: SI 2 11.5) and histone H3 overexpression 
(our definition: SI £ 8.2) tended to have poor prognosis 
but this did not reach a statistically significant level, how- 
ever the overall survival was significantly lower in patient 
with cyclin A and cydin Dl overexpression. Cox multivari- 
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Table 2l The relation between cydin Dl overexpresslon vs cydln Dl amplification and cllnieo pathological prognostic markers. 
Variables No. of cases Cydln Dl overexpresslon Cydin D t Amplification 



Tumor size (cm) 



<10 


33 


5.3 ± 3.8* 


13/33 


25.0 


27 


2Z8 ± 7.2 p <0.05 


9/27 p <0.236 


Histology 








GJ 


15 


6.6 ± 4.0 


7/15 


Cll 


21 


8.9 ± 3.6 


8/21 


GUI 


24 


2Z0±8.I ? <0.00l 


7/24 p <0.07S 


Lymph node 




Negative 


33 


5.4 ± 5.3* 


6/33(18.2%) 


Positive 


27 


20.6 ± 6.9 p <0.05 


16/27 (59.3%) p <0.008 


Depth of invasion 








m, sm 


17 


3.1 ±3.1* 


4/l7(23.S%) 


beyond sm 


43 


IZ4±6.5p<0-05 


26/43 (60.5%) p<0.00l 


Stage 








early 


45 


5.5 ± 10.1 


12/45 (26.7%) 


late 


15 


11.3 ± 9.6 P = 0./75 


10/15 (66.7%) p<0.002 



Table 3: Uunivariate analysis of the relationship between survival and the tested markers 

PredlctiveVariables Median Survival HR CI P 



Kf-67 



<II.S 


36 








ill J 


32 


1.826 


0.636-5.243 


0,26 


Cydin D / 










<6.l 


35 








2>6.l 


18 


7.246 


1.007-45.150 


0.03* 


Wfstone H3 










<8.2 


35 








£8.2 


29 


4.639 


0.854-25.196 


0.07 


Cydln A 










<I0.5 


35 








210.5 


15 


7.820 


1.017-60.122 


0.02* 


Histological grade 










Low 


38 








High 


10 


7.331 


2.696-19.940 


0.0001* 


Lymph node 










Negative 


38 








Positive 


15 


6.826 


1.973-23.621 


0.002* 


Stage 










1, 11, III 


38 








IV 


12 


6.378 


1.842 -2Z083 


0.001* 


Tumor size (cm) 










<5.0 


35 








*5.0 


13 


4.835 


1.386- 16.868 


0.01* 


Depth of Invasion 










TI.T2 


36 








T3.T4 


20 


7.759 


1.024-58.789 


0.04* 


Age (years) 










<50 


38 








250 


28 


Z802 


0.988 - 7.943 


0.0526 


Sex 










Male 


38 








Female 


36 


0.696 


00.274- 1.766 


0.4449 



* p. value < 0.05 (significant) 

HR: Hazard Ratio 

CI: 95% confidence Interval 
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Figure 5 

Kaplan-Meier survival curves for colorectal carcinoma. Overall survival is significantly lower in patients with (a) cyclin A and (b) 
cydin 01 overexpression. Patients with high SI for histone H3 mRNA have poorer prognosis but this was not statistically signif- 
icant (c). No significant difference was present between patients with high Ki-67 SI and those with low K/-67 SI (d). 



ate regression analysis revealed that lymph node metasta- 
sis, cyclin A and cyclin Dl overexpression were 
independent negative prognostic factors after adjustment 
for the depth of tumor invasion, age and sex of the patient 
(Table 4). 

Discussion 

The proliferative activity of CRC ceils has been investi- 
gated in several studies either by immunohistochemicai 
determination of cell proliferation index using antibodies 
to some types of cyclins or by flowcytometric determina- 
tion of the SPF of the cell cycle [8]. Although Leach et ai. 
[17] did not find cyclin Dl gene amplification in a panel 
of 47 CRC cell lines; its protein was overexpressed in 
about 30% of CRC cases that were included in the studies 



of Bartakova ei al. [6] and Arber ei al. [18]. In the former 
study [6]cycim Dl was aberrantly accumulated in a 
significant subset of human CRC cases and the cell lines 
derived from these cases were dependent on cyclin in their 
cell cycle progression. In the second study [18], overex- 
pression of cyclin Dl was detected in 30% of adenomatous 
polyps indicating that overexpression is a relatively early 
event in colon carcinogenesis which is possibly responsi- 
ble for the pathological changes in the mucosa preceding 
neoplastic transformation. More recently, Holland et al. 
[19], Pasz-Walczak et al. [20] and Utsunomiya et al. (21 1 
reported up-regulation of cyclin Dl in 58.7%, 100% and 
43% of their studied cases respectively. 
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Table 4: Multivariate analysis ofthe relationship between survival 
and thetested markers 



PredictbreVariabies 


HR 


a 


P 


Cyclin Dl 


10.864 


l .055 - 86.250 


0.03* 


(baseline < 6.1) 








Cyclin A 


13.886 


1.012- 190.579 


0.0490* 


(baseline < 10.5) 








Positive Lymph node 


3.921 


1.057- 14.472 


O.04I0* 


metastasis 








Stage IV 


3.411 


1.048- 1X083 


0.03* 


Depth of Invasion 








T3,T4 


5.408 


0.449 - 6S.080 


0.1836 


Age (years) 










1.996 


0.678-5.878 


0.2310 


Sex 


0.910 


0.315-2.358 


0.8453 



p. value < 0.05 (significant) 

HR: Hazard Ratio 

CI: 95% confidence Interval 



In the present study, up-regulation of cyclin Dl was 
detected in 68.3% of the cases. The SI was significantly 
higher in carcinomas than in normal colorectal mucosa 
and in poorly-differentiated adenocarcinomas it was 
approximately twice that of other histological types. 
Amplification and/or overexpression of cyclin Dl signifi- 
cantly correlated with deeply invasive tumors and positive 
lymph node metastasis. Our results in this regards are con- 
sistent with previous studies [8,22]. In 2001, Holland et 
ai. [19]. demonstrated that deregulation of cyclin Dl and 
p2l* a f proteins are important in colorectal tumorigenesis 
and have implications for patient prognosis. Similarly 
McKay et al. [23) found that cyclin Dl was the only protein 
in their panel {cyclin Dl, p53, pIG, Rb-1, PCNA and p27) 
that correlated with improved outcome in CRC patients. 
However, few studies failed to detect any correlation 
between cyclin Dl overexpression and the 
clinicopathological factors in CRC [6,18]. This contro- 
versy in results could partially be explained by the differ- 
ence in the sampling of studied cases. The present study 
included 24 cases of poorly differentiated adenocarci- 
noma, which is not common in other studies of CRC in 
western countries. This was possible because the majority 
of CRC cases diagnosed in Egypt are of high histological 
grade |3|. The correlation between cyclin Dl overexpres- 
sion and the high histological grade was also reported in 
other tumor types including non-small cell lung 
carcinomas [24] and squamous ceil carcinomas of the lar- 
ynx [16]. Another possible explanation for the observed 
controversy in the results of different studies is the detec- 
tion method used. 

In the present work, overexpression of cyclin Dl was more 
common than gene amplification of the PRAD-1 /cyclin Dl 



gene with a 63.6% concordance. This was similarly 
reported by Bartakova et al. [6] who mentioned that there 
is a subset of CRC cases in which cyclin Dl is overex- 
pressed without PRAD-1 /cyclin Dl gene amplification. 
Consistent with this hypothesis are reports of elevated cyc- 
lin Dl mRNA levels and immunohistochemically detecta- 
ble accumulation of the protein in over one third of breast 
cancer cases at a frequency significantly higher than that 
deduced from DNA amplification studies [9,25]. These 
data imply that mechanisms other than gene 
amplification can also lead to deregulation and accumu- 
lation of cyclin Dl in solid tumors. 

So far, several studies were done to reveal the prognostic 
significance of cyclin Dl overexpression in various carci- 
nomas, including CRC [22], However, these studies 
yielded conflicting results which could be attributed to 
organ heterogeneity. In our study, patients with tumors 
that exhibited cyclin Dl overexpression tended to have 
poor prognosis. 

It was reported that, patients with cyclin A positive carci- 
nomas had significantly shorter median survival times. 
Handa et al. [8] were able to delect cyclin A overexpression 
in 77% of their CRC cases. They also demonstrated that, 
cylcin A could be used as a prognostic factor of CRC. More 
recently, Habermann et al. [26] studied cases of ulcerative 
colitis with and without an associated adenocarcinoma 
for the presence of cyclin A overexpression. They found 
that, cyclin A overexpression was higher in cases of ulcera- 
tive colitis with adenocarcinomas than in those without 
adenocarcinomas. Consequently, they concluded that, 
cyclin A could be used for monitoring ulcerative colitis 
patients and for the early detection of an emerging carci- 
noma in this high risk group of patients. 

In our study, cyclin A was detected in 80% of the patients 
and Cox regression analysis showed that it could be used 
as a prognostic marker in CRC in addition to cyclin Dl . 

It would have been useful if we assessed the expression 
level of cyclin A by another technique (DNA 
amplification). This would have added more information 
regarding the gene status on one hand and confirmed the 
results of IHC on the other hand. Unfortunately, this was 
not possible because in most of the cases included in the 
present work, the extracted DNA was not sufficient to 
study cyclin amplification after the assessment of cyclin Dl . 

In 1996, Nagao et al. [11] reported that histone H3 labe- 
ling index significantly correlated with ki-67 immunos- 
taining and was high in poorly differentiated human 
hepatocellular carcinoma. This was similarly reported in 
the present work since we found a significant correlation 
between the SI of histone H3 and Ki-67. However, no 
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statistically significant correlation was found between his- 
tone H3 SI and any of the studied clinicopathological 
factors. 

Although Ki-67 immunostaining reflects the proliferative 
activity of CRC, it has not been recognized as a significant 
prognostic factor in this type of tumors [27,28]. However, 
Suzuki at al. [29] found a significant correlation between 
Ki-67 labeling index and local invasion of CRC. In the 
present study there was a significant relationship between 
the SI of Ki-67 1 tumor size and grade. However, Kaplan- 
Meier survival curves showed no significant difference in 
survival rates between patients with- and without overex- 
pression of Ki-67. 

Conclusions 

Our results demonstrate that cyclin Dl, cyclin A, histone H3 
and Ki-67 are overexpressed in a subset of CRC, however 
only cyclin Dl and cyclin A overexpression correlates with 
poor differentiation and tumor progression. This indi- 
cates the superiority of cyclin A and cyclin Dl as indicators 
of poor prognosis compared to Ki-67 and histone H3 
mRNA in CRC. Cyclin A and Dl could therefore be consid- 
ered significant, independent prognostic factors in CRC 
patients. These findings are especially important in stage 
II patients since 25-30% of those patients have poor prog- 
nosis in spite of being node- negative. However, the stand- 
ard clinicopathologic prognostic factors can not identify 
this subset accurately and therefore; there is a great 
demand for more accurate, individually-based, biological 
prognostic parameters that help in detecting this high risk 
group of patients who can benefit from an adjuvant ther- 
apy. If the findings of the present study are confirmed in a 
larger study, evaluation of cyclin A and Dl may be applica- 
ble to clinical management of CRC, allowing the identifi- 
cation of patients with poor prognosis. 
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Correlation of amplification and overexpression of the c-myc 
oncogene in high-grade breast cancer; FISH, in situ hybridisation 
and immunohistochemical analyses 
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In this study, we analysed gene amplification, RNA expression and protein expression of the c-myc gene on archival tissue specimens 
of high-grade human breast cancer, using fluorescent in situ hybridisation (FISH), nonradioactive in situ hybridisation and 
immunohistochemistry. The specific question that we addressed was whether expression of c-Myc mRNA and protein were 
correlated with its gene copy amplification, as determined by FISH. Although c-Myc is one of the most commonly amplified 
oncogenes in human breast cancer, few studies have utilised in situ approaches to directly analyse the gene copy amplification. RNA 
transcription and protein expression on human breast tumour tissue sections. We now report that by using the sensitive FISH 
technique, a high proportion (70%) of high-grade breast carcinoma were amplified for the c-myc gene, irrespective of status of the 
oestrogen receptor. However, the level of amplification was low, ranging between one and four copies of gene gains, and the majority 
(84%) of the cases with this gene amplification gained only one to two copies. Approximately 92% of the cases were positive forc- 
myc RNA transcription, and essentially all demonstrated c-myc protein expression. In fact a wide range of expression levels were 
detected Statistically significant correlations were identified among the gene amplification indices, the RNA expression scores and 
protein expression scores, c-myc gene amplification, as detected by FISH, was significantly associated with expression of its mRNA. as 
measured by the intensity of in situ hybridisation in invasive cells (P = 0.0067), and by the percentage of invasive cells positive for 
mRNA expression (P = 0.0006). c-myc gene amplification was also correlated with the percentage of tumour cells which expressed 
high levels of its protein, as detected by immunohistochemistry in invasive cells (P = 0.001 6). Thus, although multiple mechanisms are 
known to regulate normal and aberrent expression of c-myc, in this study, where in situ methodologies were used to evaluate high- 
grade human breast cancers, gene amplification of c-myc appears to play a key role in regulating expression of its mRNA and proteia 
British Journal of Cancer (2004) 90, 1612- 1619. doi:IO.I038/sj.bjc.660l703 www.bjcancer.com 
Published online 30 March 2004 
© 2004 Cancer Research UK 
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The c-myc oncogene has been shown to be amplified and/or 
overexpressed in many types of human cancer (Marcu et al, 1992; 
Nass and Dickson, 1997; Nesbit et al, 1999; Uao and Dickson, 
2000). Numerous experiments in vivo have also causally linked 
aberrant expression of this gene to the development and 
progression of cancer in different body sites (Marcu et al, 1992; 
Nass and Dickson, 1997; Nesbit et al, 1999; Liao and Dickson, 
2000). However, several critical issues regarding the significance of 
c-myc in human cancer still remain obscure. First, even for a given 
type of malignancy, the frequencies of the alterations of c-myc at 
the cytogenetic and expression levels vary gready from one report 
to another (Liao and Dickson, 2000). For instance, the frequencies 
of its amplification, mRNA and protein overexpression in breast 
cancer vary between 1-94, 22-95 and roughly 50-100%, 
respectively, among different reports (Liao and Dickson, 2000). 
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Thus, it is still unclear to what extent this gene is altered at the 
cytogenetic level and at different expression levels in breast 
carcinoma. 

One controversial issue pertains to the prognostic value of c-myc 
gene alterations in cancer. The central role of c-Myc protein in 
accelerating cell proliferation, documented by many early studies, 
has led to a general concept for many types of cancer that 
amplification or overexpression of this gene may be associated 
with a more aggressive tumour and a poorer patient survival 
(Berns et al, 1992; Marcu et al, 1992; Sato et al, 1995; Nass and 
Dickson, 1997; Nesbit et at, 1999; Visca et al, 1999; Liao and 
Dickson, 2000). However, many reports have shown an opposite 
correlation (Sikora et al. 1985, 1987; Watson et al, 1986; Polacarz 
et al, 1989; Voravud et at, 1989; Williams et al, 1990; Melhem et at, 
1992; Pietilainen et al, 1995; Dieboid et al, 1996; Smith and Goh, 
1996; Augenlich et al, 1997; Bieche et al, 1999), while other studies 
do not support either of these conclusions. For instance, gene 
amplification or overexpression of c-Myc protein has also been 
shown to associate with a better tumour differentiation or a better 
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patient survival for cancer of the testis, ovary, bile ducts, colon and 
breast {Sikora et al, 1985, 1987; Watson et al, 1986; Polacarz et al, 
1989; Voravud et al, 1989; Williams et al, 1990; Melhem et al, 1992; 
Pietilainen et al, 1995; Diebold et al, 1996; Smith and Goh, 1996; 
Augenlich et al, 1997; Bieche et al, 1999). This controversy does 
not appear to be related completely to the cancer type, since both 
positive (Berns et al, 1992; Visca et al, 1999) and negative 
(Williams et al, 1990; Melhem et al, 1992; Pietilainen et al, 1995; 
Smith and Goh, 1996; Augenlich et al, 1997; Bieche et al, 1999) 
correlations have been reported for colon cancer and breast 
cancer. More interestingly, c-Myc overexpression has been shown 
to predict a poorer prognosis for cutaneous melanoma, but a 
favourable outcome for uveal melanoma (Grover et al, 1997; Chana 
et al, 1998a, b, 1999; Grover et al, 1999). These data indicate 
different roles of c-Myc, even in the same type of tumour, perhaps 
depending upon different tissue microenvironments. 

Another controversial issue concerns the nuclear -cytoplasmic 
localisation of c-Myc. Studies of neoplasms of the colon, testis, 
ovary and liver have shown that predominantly nuclear localisa- 
tion of c-Myc tends to occur in benign lesions, while cytoplasmic 
localisation tends to occur in more malignant tumours (Sikora 
et al, 1985; Sundaresan et al, 1987; Melhem et al, 1992; Sasano et al, 
1992; Yuen et al, 2001). Whether these patterns of subcellular 
localisation of c-Myc tend to reflect the malignant status of breast 
cancer remains an enigma. 

A recent study of the impact of DNA amplification on gene 
expression patterns in breast cancer used mRNA and DNA from 14 
breast cancer cell lines. Analysis was conducted with a 13000 
cDNA clone array for gene expression measurement and a 
Comparative Genomic Hybridisation (CGH) microarray for gene 
copy number measurements. This study also included known 
breast cancer genes, such as c-myc, HER/2-neu and aibl (Hyman 
et al, 2002). Interestingly, 44% of the most highly amplified genes 
were also overexpressed at the mRNA level. Consistent with this 
pattern, c-Myc gene copy number and its expression levels showed 
a statistically significant (or = 0.020) correlation in this microarray 
study of breast cancer cell lines. Another study, by Pollack and 
colleagues, used microarray analysis and BAC array CGH of RNA 
and DNA (respectively) extracted from intermediate grade human 
breast tissues, and tested for amplification and expression of c- 
Myc (among other genes). This study demonstrated that two out of 
37 specimens were both amplified and overexpressed, while others 
were either amplified or overexpressed, but not both. The authors 
of this study suggested that contaminating stromal tissue may 
compress the fluorescence ratios leading to underestimates of gene 
amplification and overexpression (Pollack et al, 2002). 

To more clearly address the importance of gene amplification 
and expression of c-Myc in human breast cancer, we used in situ 
methodologies, which can clearly distinguish stromal and carci- 
noma components. We studied the amplification and over 
expression of the c-myc gene with fluorescent in situ hybridisation 
(FISH), non-radioactive in situ hybridisation (1SH) and immuno- 
histochemical (IHC) approaches on paraffin-embedded biopsy 
sections of untreated, high-grade breast cancer. It was observed 
that 70, 92 and 70% of the cancer cases exhibited c-m^c gene 
amplification, its mRNA overexpression and its protein over 
expression, respectively. In most of the cases (84%) that showed 
gene amplification, the c-myc gene gained only one to two copies, 
which is consistent with c-myc FISH data from other studies. 
Unlike some oncogenes, such as U-myc, which typically demon- 
strates gene amplification copy numbers of greater then 10 in 
neuroblastoma, and HER-2/neu (Sartelet et al, 2002), whose copy 
numbers range up to 14-40 in breast carcinomas (Isola et al, 
1999), gene copy numbers of c-myc are not as greatly increased. In 
the study noted earlier, using breast cancer cell line CGH array and 
cDNA microarray expression analysis, it was demonstrated that 
the most dramatically increased expression levels were associated 
with large gene copy number increases, although low-level gains 



and losses had a significant influence on gene expression 
dysregulation (Hyman et al, 2002). Only one study has been 
published (Pollack et al, 2002) that has begun to determine if these 
findings are directly relevant to actual human breast tumour 
tissues, since many of the genetic changes in tissue culture cell 
lines are more extreme than those displayed in primary tumour 
material. Furthermore, the relationships among gene amplifica- 
tion, mRNA expression and c-Myc protein expression were not 
explored in prior human breast cancer cell line and tumour tissue 
studies (Hyman et al, 2002; Pollack et al, 2002). 

In our human breast tumour tissue study, a high correlation was 
found between c-myc FISH and ISH, for both percentage of 
staining (P< 0.0067) and intensity positive cells (P<0.0006), In 
addition, c-myc gene copy amplification by FISH was correlated 
with c-Myc protein expression positive cells by IHC (P< 0.0016). 
These results support the idea that c-Myc overexpression of both 
mRNA and protein is related to the copy number of the c-myc 
DNA amplification. We show in this study that amplification and 
overexpression of c-Myc occur with high frequency in high-grade 
human breast cancer tissues. 



MATERIALS AND METHODS 
Materials 

Formalin-fixed, paraffin-embedded tissue blocks of breast carci- 
noma and normal breast tissue were obtained from the 
Histopathology and Tissue Shared Resource at the Lombardi 
Comprehensive Cancer Center (LCCC), at Georgetown University 
Medical Center. The criteria for tumour selection were the 
following: negative progesterone receptor status, metastases to 
auxiliary lymph nodes and high grade (Elston Score >7). The 
oestrogen receptor status of the tumours was known from archived 
pathology reports. The parameters were chosen from our prior 
meta-analysis (Deming et al, 2000), as indications of a high 
likelihood of c-myc gene amplification. Normal breast tissue 
specimens were from reduction mammoplasty. Serial sections 
(5 /cm) for FISH, ISH and IHC were prepared by the LCCC 
Histopathology and Tissue Shared Resource. 

FISH 

A dual-label FISH technique was used (Jenkins et al, 1997). Slides 
were baked overnight at 60°C to assure adherence of the sample. 
Tissue sections were deparaffinised with two successive, lOmin 
xylene washes, and then dehydrated in a graded ethanol series of 
70, 80 and 95% at room temperature. Samples were then digested 
with 4% pepsin (Sigma, St Louis, MO, USA) at 45°C for lOmin. 
DNA probes used were an alpha satellite probe to chromosome 8, 
labelled with biotin, and a c-myc probe, labelled with digoxigenin 
(Ventana, Tucson, AZ, USA). Codenaturation was performed at 
90° for lOmin on a hot plate. Hybridisation was at 37°C for 12- 
16 h. Detection of signals was accomplished with an antiavidin 
antibody labelled with Texas Red, and an antidigoxigenin antibody 
conjugated to fluorescein (Ventana, Tucson, AZ, USA). Slides were 
postwashed in 2 x SSC at 72°C for 5 min and counterstained with 
DAPI to visualise cell nuclei. Results were viewed and quantified 
with a Zeiss Axiophot fluorescence microscope, equipped with 
appropriate filters and an Applied Imaging Cytovision system 
(Pittsburgh, PA, USA). In this approach, the c-myc unique 
sequence probe was visualised as a green signal and the control 
probe for the chromosome 8 centromere was red, thus easily being 
distinguished when scored. 

One serial section from each tumour sample was stained with 
haematoxylin and eosin and first reviewed by a pathologist (BS), to 
help identify the tumour area of the section. This procedure 
ensured that the tumour cells, but not the normal cells, were 
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counted. Nuclei of up to 50 tumour cells were scored from each 
FlSH-stained section, independendy by two investigators. Hybri- 
disation signals were averaged, and the amplification index was 
presented as the number of c-myc signals divided by the number of 
chromosome 8 centromere signals. A 1.8-fold increase was used as 
the criterion to judge the presence of c-myc gene amplification. 

In situ hybridisation 

In situ hybridisation (ISH) was carried out with a nonradioactive 
method, described previously (Liao et al, 2000a, b). One serial 
section from each specimen was hybridised overnight at 60°C with 
riboprobes, that were in vitro transcribed from the antisense or 
sense strand of an approximately 300 bp cDNA of human c-myc 
(ATCC, Manassas, VA, USA), labelled with digoxigenin-conjugated 
UTP. The sections were then incubated with an antibody against 
digoxigenin, followed by incubation with a second antibody 
conjugated to alkaline phosphatase. The signal was visualised by 
colour development with 5 bromo-4-chloro-3-indolyl phosphate 
and nitroblue tetrazolium. All reagents were purchased from 
Boehringer Mannheim, Indianapolis, IA. To control the signal 
specificity, two serial sections were mounted on the same slide for 
hybridisation with the antisense and sense probes, respectively. 
ISH was given an intensity and percentage scores, based on 
intensity of positive staining and number of cells staining, 
respectively. Intensity scores were assigned 0, 1, 2 and 3, and 
percentage scores were assigned as 1- 1-25, 2- 26-50, 3- 51-75 
and 4- 76-100%. 

Immunohisto chemistry 

Immunohistochemical staining (IHC) was performed using an 
avidin-biotin complex (ABC) method described previously (Liao 
et al, 1998). One serial section of each specimen was deparaffinised 
and blocked with 3% peroxide. Antigens were retrieved by heating 
slides in a microwave oven in 50 mM citrate buffer, pH 6.4, at 
boiling temperature, for 12min. After blocking with 6% normal 
goat serum, the section was incubated with a mouse monoclonal 
antibody to human c-Myc (9E10, Sigma Chemical Company, St 
Louis, MO, USA) at 1:100 dilution for 2h, followed by lh 
incubation with a second antibody conjugated with biotin (Vector 
Laboratories Inc., Burlingame, CA, USA). The section was then 
incubated with peroxidase-conjugated avidin (Dako, Corporation, 
Carpinteria, CA, USA) for 30 min, followed by colour development 
with diaminobenzidine and peroxide. All procedures were carried 
out at room temperature. To control the signal specificity, serial 
sections from 10 tumour samples were also stained using an 
alternate c-Myc antibody (CI 9 from Santa Cruz Biotechnology 
Inc., Santa Cruz, CA, USA) at 1 : 60 dilution. This antibody resulted 
in focally positive staining in the tumour, but the staining intensity 
was weaker. To control the signal specificity, serial sections were 
made from five selected positive cases which were subjected to the 
same staining procedure, with a normal mouse IgG to replace the 
c-Myc antibody. This control staining did not give rise to a signal, 
demonstrating the specificity of the c-Myc antibody signal. IHC 
staining was given an intensity and percentage score based upon 
the intensity of positive staining and number of cells staining. 
Intensity scores were assigned 0, 1, 2 and 3 and percentage scores 
were assigned as 1- 1-25, 2- 26-50, 3- 51-75 and 4- 76-100%. 
Determinations were made of cellular localisation of c-Myc 
antibody staining to cytoplasm and/or nucleus in normal and 
invasive cells within each breast tumour specimen. 

Statistical analyses 

For each analysis of gene copy amplification (FISH), mRNA 
expression (ISH) and protein expression (IHC), all cases were first 
grouped as positive or negative to calculate the percentages of 



positive cases and negative cases, as described (Zar, 1974). Fisher's 
exact test was used to compare percentages, and two-sample /-test 
or Wilcoxon rank test was used to compare average scores. Both 
ISH and IHC were given intensity and percentage scores, based on 
intensity of positive staining and number of cells staining, 
respectively. As noted earlier, intensity scores were assigned 0, 1, 
2 and 3 and percentage scores were assigned as 1- 1 -25, 2- 26-50, 
3- 51-75 and 4- 76-100%. A score of >2 for either intensity of 
staining or percentage of cells positive by ISH was assigned as 
high. For IHC, an intensity score of > 1 was assigned as high and a 
percentage score of >3 was categorised as high. Each amplifica- 
tion index was paired with its corresponding mRNA expression 
score to calculate the coefficient r. The same method was used to 
estimate the association of the amplification indices with the c- 
Myc protein expression levels, and the association of the mRNA 
expression levels with the protein expression levels. A P-value of 
0.05 or less was used to determine the statistical significance in all 
analyses. In all, 54 pairs of normal vs invasive tissues were analysed 
using McNemars x 2 test to determine if there was a difference in 
cellular localisation of c-Myc antibody signal to nuclear or 
cytoplasmic compartments. 



RESULTS 

FISH analysis of gene amplification 

Amplification of the c-myc gene was measured by a FISH test in 46 
cases of breast cancer; Figure 1 demonstrates cells with no 
amplification (one copy of c-myc /one copy of chromosome 8 
centromere, and a moderate amplification a 3/1 ratio). Amplifica- 
tion was calculated by the number of c-myc signals divided by the 
number of chromosome 8 alpha satellite signals. A 1.8-fold 
increase cut-off was used to judge gene amplification. As shown 
in Table 1, 32 out of 46 (70%) cases were gene amplified for c-myc, 
whereas only 30% (14/46) of the cases showed amplification 
indices lower than the cut-off value. The amplification indices for 
most (84%, or 27/32) cases with gene amplification, ranged 
between 1.8- and three-fold, indicating that the locus gained up to 
two copies of c-myc in the majority of the cases. The percentage of 
cases with gene gains of three copies or higher was 1 1% (five out of 
46) of total cases analysed, or near 16% (five out of 32) of the cases 
with gene amplification, including one case (2% of total cases or 
3% of the cases with gene amplification) with the highest index of 5 
(a gain of four copies). 

In all, 28 of the breast carcinomas in this study were ER 
negative, and 14 were ER positive. The average c-myc gene 
amplification score was 1.896 (s.e. = 0.196) for ER positive and 
2.201 (s.e. = 0.157) for ER negative. Although ER-negative tumours 
had a slightly higher average c-myc score, the difference was not 
statistically significant (two-sided P= 0.252 from two-sample Mest 
and 0.251 from Wilcoxon rank test), consistent with the results of 
our prior meta-analysis of the literature (Deming et a/, 2000). 

in situ hybridisation analysis of c-myc mRNA expression 

A total of 5 1 breast cancer samples were studied for c-Myc mRNA 
expression, with non radioactive in situ hybridisation (ISH). ISH 
results were assigned intensity and percentage scores based upon 
signal intensity of positive staining and number of celts staining 
within the sample, respectively. As shown in Table 2, 86% (44 out 
of 51) tumours were scored as high in intensity, and 92% (47out of 
51) had more than 51% positive cells, also considered as highly 
increased c-Myc expression. mRNA expression was heterogeneous 
in the breast tumour tissue, and no morphologic subtype was 
predominant in the high or low categories. One case showed no c- 
Myc ISH staining. In 79% (38/48) of cases, epithelia in normal 
mammary glands adjacent to the tumour also showed a high 
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Figure I RSH analysis of c-myc amplification in tumour cells from breast tumour tissue sections. FISH probe for human c-myc unique-sequence is seen as 
green, white the normal control signal* a centromeric probe signal for chromosome 8 is shown in red. The nuclei of tumour cells were visualised by DAPI 
counter-staining, (A) 1:1 copy ratio of c-myc to chromosome 8 (c-myc/8 centromere), indicating no amplification of c-myc in tumour cells. (B) 1:3 copy 
ratio of c-myc to chromosome 8 (c-myc/8 centromere), a moderate amplification of the c-myc gene. 



Table I c-myc gene copy amplification analysis by RSH in poor 
prognosis human breast tumour samples 



Amplification index 




(#c-myc signals/ft 


Percentage of samples with 


control signals)* 


FISH ratios in each category 


IX)- 17 


30% 14 out of 46 


1.8-1.99 


20% Nine out of 46 


2.0-2.9 


3956 18 out of 46 


>3.0 


11% Five out of 46 



Analysis was conducted on 46 individual paraffin-embedded tissue samples with 
negative progesterone receptor status, positive lymph node involvement and high 
tumour grade. + Normal control ratio is I. 



intensity of staining. In three cases, no staining was seen in the 
normal terminal duct lobular units. Figure 2 shows representative 
fields of high, medium and low c-myc mRNA expression levels in 
invasive ductal carcinoma samples. 

Association of FISH and ISH 

c-Myc scores were dichotomised as binary variables (high or low), 
and a score of 2 or higher was categorised as high on ISH. A score 
higher than median was categorised as high from PISH studies. 
These dichotomised scores are depicted in Table 3. A Fisher's exact 
test was performed for comparing binary responses to see if there 
was any association between FISH and ISH. It was found that the 
FISH score was significantly associated with percentage of staining 
in the invasive cells (P = 0.0067, two-sided McNemar's test) and 
also with the intensity score on ISH (P = 0.0006, two-sided). 



Immunohistochemicai staining of c-Myc proteins 

In total, 51 breast carcinomas, which were subjected to FISH 
analysis, and all of which also had been analysed for c-myc mRNA 
by in situ hybridisation, were also analysed for the expression of c- 
Myc protein, using immunohistochemicai staining with the 9E10 
antibody. IHC results were assigned an intensity and percentage 
score based on intensity of positive staining and number of cells 
staining, respectively. Intensity scores were assigned 0, 1, 2 and 3 
and percentage scores were assigned as 0, 1- 0-25, 2- 26-50, 3- 



51-75 and 4- 76-100. For IHC, an intensity score of > I was 
assigned as high and a percentage score of > 3 was categorised as 
high. Figure 2 shows examples of high, medium and low levels of c- 
myc antibody staining in invasive ductal carcinoma samples. In 34 
cases, normal tissue was seen; 30 of these showed cytoplasmic 
staining and 22 had nuclear staining in terminal ductal lobular 
units. In all, 12 cases showed 1 + , 14 cases 2 + and four cases 3 + 
cytoplasmic staining. In situ hybridisation revealed positive 
staining in 46 out of 49 cases with normal tissue. Seven cases 
showed 1+, 13 cases showed 2+ and 26 cases showed 3 + 
staining by ISH. Both irnmunohistochemistry and in situ 
hybridisation showed diffuse positivity in adipocytes. 

Table 4 shows the staining pattern for the cohort In all, 70% (36 
out of 51) of cases showed high intensity of staining for c-Myc 
protein, while 85% (29 out of 34) of cases with detectable staining 
had more than 76% positive cells, also considered as high 
expression. To verify the staining specificity, serial sections from 
10 tumour specimens that were positive for 9E10 antibody were 
also stained using the CI 9 rabbit polyclonal ant i -c-Myc antibody. 
Results revealed a staining pattern similar to 9E10. However, the 
staining intensity with C19 was weaker than 9E10. The specificity 
of these two antibodies was verified by Western blots in previous 
studies (Persons et al t 1997; Liao et al, 2000b). Figure 2 shows 
results of c-Myc in situ hybridisation and irnmunohistochemistry 
studies on samples considered to demonstrate low, moderate and 
high levels of c-Myc expression. Analysis of c-Myc protein 
localisation results in the nucleus or cytoplasmic compartments 
of normal and invasive cells within the tumours revealed that 
nuclear staining was positive in 41% of normal cells, compared to 
22% of invasive cells (statistical significance at P=0.01 by 
McNemar's two-sided % 2 test). The increase in relative cytoplasmic 
localisation of c-Myc protein, comparing normal (53.7%), to 
invasive cells (61.1%) was not significantly different. Thus, the data 
are consistent with partial exclusion of c-Myc from the nuclei of 
invasive breast cancer cells. 

The FISH score was significantly associated with the percentage 
positivity of invasive cells, as seen on IHC studies of c-Myc. 
However, 40% of tumours displayed a low index of c-wyc gene 
amplification, but still expressed high levels of c-Myc protein 
(Table 6), indicating the possibility of other mechanisms of over 
expression unrelated to gene amplification in at least some 
tumours. The FISH score was not significantly associated with 
the intensity of IHC staining in the invasive cells (not shown), in 
contrast to the IHC percentage positivity score. 
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Table 2 c-myc mRNA in s/tu hybridisation (ISH) results 



Staining intensity 


0 12 3 Percent positivity 


12 3 4 


Number of tumour samples in each category N = 5 1 


1 6 25 19 Number of tumour samples in each level category SI 


1 1 5 42 



In all S I human high-grade breast carcinomas were analysed to determine the relationships between c*Myc mRNA expression and c-myc gene in situ hybridisation results. Data 
are shown in two ways in the above table, first overall staining intensity of c-Myc-positwe cells was scored as 0, 1, Z 3 (low to high), and the number of tumour samples at each 
level of staining indicated on the line below. Next, the percentage of tumour cells staining was scored as a I. 2 3. 4 (low to high %. as discussed in Materials and Methods). The 
number of tumours at each level of percent cell positivity for c-Myc is then indicated on the fine below. 




DISCUSSION 

Although there have been many reports on c-myc amplification in 
human breast cancer (Liao and Dickson, 2000), there are only two 
published studies involving application of the FISH technique to 
unfixed, frozen sections (Persons et aU 1997; Visscher et al, 1997), 
and one prior study using FISH on an archival human tissue 
microarray (Schraml et al, 1999). Another recent study applied 
FISH to evaluate c-myc amplification in ductal carcinoma in situ 



(DCIS) (Aulmann et at, 2002). Using the FISH technique on 
formal in- fixed, paraffin-embedded sections, we now show that 
70% of high-grade breast cancer samples bear c-myc gene copy 
amplifications. Interestingly, the above-mentioned study, using 
FISH and focusing on DCIS, detected amplification of c-myc in 
only 20% of cases, but found a correlation of c-myc with increased 
tumour size and proliferation (Aulmann et al, 2002). 

The level of amplification of c-myc in our study ranged between 
one and four additional copies of the gene; the majority (84%) of 
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the cases with the gene amplification gained only one to two 
copies, also consistent with FISH data reported for c-myc copy 
amplification in human metastatic prostate carcinoma tissues 
(Jenkins et ah 1997). The relationship between the level of c-myc 
gene copy amplification and the level its increased mRNA 
expression has been examined previously in breast cancer cell 
lines (Hyman et ah 2002). In general, it has been concluded that 
the two scores coordinate for c-myc, as is the case for many breast 
cancer genes. However, only 44% of the highly amplified genes, in 
general, showed increased RNA expression, and only 10.5% of the 
highly overexpressed genes were gene copy-amplified in the cell 
line study (Hyman et ah 2002). Another analysis was conducted to 
study of relationships between gene amplification and expression 
of 6095 genes in 37 intermediate grade human breast tumours. 
This study demonstrated that 62% of the highly amplified genes 
also showed elevated expression; overall, a two-fold change in 
DNA copy number was associated with a 1.5-fold change in mRNA 
levels. Overall, 12% of the variation in gene expression in the 
breast tumours studied was associated with gene copy number 
variation (Pollack et ah 2002). Further study of additional human 
breast tumours, at precisely defined grades and stages, will be 
necessary in order to more fully define the relationships between 
DNA copy numbers and expression of genes. The studies we report 
here indicate higher levels of c-Myc gene amplification and 
expression, than other previous reports in breast cancer. We 
believe that this is probably the result of our analysis of individual 
tumour cells in a well-defined set of high-grade breast tumours. 
Prior c-Myc expression and amplification microarray studies used 
tumour specimens which contain normal stromal components, 



Table 3 Correlations between c-myc gene copy number (RSH) mRNA 
expression (ISH) 

FISH 



Low 



High 



(A) ISH (56 cells) 

Low 

High 



(B) ISH (intensity) 

Low 

High 



/ 

19 



2 

18 



3 

id 

P = 0.0067 



5 
(6 

P = 0.0006 



Serial sections of high-grade human breast carcinomas were scored for c-myc gene 
copy number (RSH. Table I) and mRNA expression (ISH, Table 2). In (A), a positive 
correlation (P = 0.0067) was observed between tumour samples with a high 
percentage of cells demonstrating mRNA expression and a high c-myc gene copy 
number. A score of 2 or higher was classified as high on ISH, and a score of median or 
greater was categorised as high on FISH. In (B). a positive correlation (P — 0.0006) 
was shown between a high level of intensity for c-Myc RNA expression a high and c- 
myc gene copy number. Note that a pairwise comparison of FISH and ISH was not 
possible for all cases, due to incomplete overlap of cases analysed with each assay. 



potentially underestimating amplification and expression levels of 
the invasive tumour components (Pollack et ah 2002). 

Our study reports a percentage of tumours gene amplified for c- 
myc (using FISH in high-grade tumours) that is much higher than 
the average figure (15.5%) reported in the literature (Isola et ah 
2002). Most of the prior studies have employed the relatively 
insensitive Southern blot technique* and were reviewed in a 
recent meta-analysis (Deming et ah 2000). Consistent with this 
prior literature background* a recent study of 94 lobular and 
ductal breast cancers assessed amplification of c-myc by 
using a semiquantitative PCR assay and protein expression, with 



Table 5 Nuclear/cytoplasmic localisation of c-Myc comparing normal 
and invasive cells 



1617 



Normal cells 
(frequency percent) 


Invasive cells 
(frequency percent) 




Total 


(A) Nuclear localisation 










28 


+ 
4 


32 


+ 


14 


8 


22 


Total 


42 


12 


54 


(B) Cytoplasmic localisation 










12 


13 


25 


+ 


9 


20 


21 


Total 


21 


33 


54 



In all, 54 pairs (normal vs invasive) of tissues were analysed to answer the questions of 
(I) whether positivhy of nuclear ceHs in normal tissues is different from thai in 
invasive cells, and similarly (2) whether positivity of cytoplasmic cells in normal tissues 
is cfifferent from that in invasive cells. The data are summarised in the above 
contingency tables. In all, 22 normal cell specimens were positive for c-Myc staining 
(4071%). compared to 12 specimens (22.2%) in invasive cells. The difference is 
statistically significant (P=0l0I) by McNemars x 2 test (two-sided). 



Table 6 Correlation between c-Myc protein expression (IHC) and c- 
myc gene copy number (RSH) 



IHC (% ceRs) 




FISH 




Low 




High 


Low 


3 




0 


High 


10 




15 








P = 0.00 16 



Consecutive serial sections of high-grade human breast tumours were scored for o 
myc gene copy number or protein expression, by immunohistochemistry (IHC). IHC 
scores were defmed in the Materials and methods section. Data were analysed for 
correlations between the results. A highly significant correlation was observed 
between high c-Myc protein expression (IHC) between percent cells positive and 
high c-myc gene amplification (FISH). P- 0.00 1 6 from two-sided McNemar's test 
Note that for IS cases, no staining for c-Myc could be detected: these negative cases 
were not included in the correlation presented, above. 



Table 4 c-Myc immunohistochemistry (IHC) results 



Staining intensity 


0 


1 


2 


3 


Percent positivity 


1 


2 


3 


4 


Number of tumour samples in each category 


15 


13 


20 


3 


Number of tumour samples in each category 


2 


2 


1 


29 



In all. 5 1 high-grade human breast carcinomas were analysed to determine the relationships between c-Myc protein expression and c-myc gene in situ hybndisalion results Data 
are shown in two ways in the above table. First overall staining intensity of c-Myc-posittve cells was scored as 0. 1, 2. 3 (low to high), and the number of tumour samples at each 
level of staining is indicated on the line below. Next, in a random subset of these cases, the percentage of tumour cells staining was scored as 0. I. 2, 3. 4 (low to high %. as 
discussed in Matenals and methods). The number of tumours at each level of percent cell positivity for c-Myc is indicated on the line below. 
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densitometry, after Western blot. These data showed c-myc gene 
amplification in 21% of tumours (Jenkins et al, 1997), using assays 
not based on in situ discrimination of tumour v$ nontumour cells. 
The lower frequency of c-myc in this prior study is in contrast with 
the data we present here, and could be the result of the higher 
sensitivity and precision of the FISH and immunohistochemical 
methods, as distinct from quantitative PGR and Western blot 
densitometry. In addition, the 70% of amplified tumours in our 
study is also much higher than the 12% reported by Schraml et al 
(1999), using a c-myc FISH test on a tissue microarray. This large 
difference may be because the arrays are prepared from cores of 
paraffin-embedded tissue, as small as 0.6 mm in diameter which 
may contain too few tumour cells for complete analysis of 
amplification of a gene, such as c-myc. c-myc is known to be 
quite heterogeneous in its gene amplification within individual 
tumours (in contrast to HER2/neu, for example) (Persons et al> 
1997). 

Most previous reports on the expression of c-myc mRNA have 
utilised Northern blot, dot blot or PCR-based approaches, while 
just a few involved in situ hybridisation, which were primarily 
performed on frozen tissue sections (Liao and Dickson, 2000). 
Normal breast tissue is dominated by adipose cells, differing 
greatly from tumour tissue in its epithelial cellularity. Thus, 
normal and tumour tissues may not be rigorously compared by 
techniques involving RNA extraction from total tissue. Therefore, 
conclusions such as 'increased expression 1 may be more difficult to 
make from studies with Northern blot, dot blot and PCR-based 
techniques that require RNA extraction from tissues that have not 
been fastidiously micro-dissected for selection of tumour cells. 
Using a more sensitive, nonradioactive in situ hybridisation (ISH) 
approach on formalin-fixed, paraffin-embedded sections, we 
report herein high expression of c-myc mRNA in 92% of high- 
grade breast carcinomas. This figure is much higher than the 
recently reported data (22%), obtained by using a real-time RT- 
PCR method (Bieche et al, 1999). Dilution of the RNA from 
epithelium by the RNA from adipose in normal breast tissue in this 
latest prior report may be one of the possible explanations for this 
large difference. 

In conclusion, the present study shows that approximately 70, 92 
and 70% of biopsies of untreated high-grade breast cancer exhibit 
c-myc gene amplification, mRNA overexpression and protein 
overexpression, respectively. In most cases (84%), with gene copy 



amplification, the c-myc gene gains one to two additional copies, c- 
myc gene amplification was significantly associated with expres- 
sion of its mRNA (both by intensity in invasive cells and by 
percentage positivity in invasive cells), and with expression of its 
protein (by percentage positivity in invasive cells). However, our 
data were also consistent with the prior literature on c-Myc 
(reviewed in Nass and Dickson, 1997; Liao and Dickson, 2000), 
indicating complex transcriptional, post transcriptional, transla- 
tional and post-translational control of c-Myc expression in vitro. 
Specifically, in Table 5 we observed that in 40% of the high-grade 
tumours tested, c-Myc protein was expressed at high levels, despite 
a lack of its gene amplification. 

It will be interesting to analyse lower grade tumours and 
premalignant lesions, with the same measurement tools, to 
determine if this c-myc amplification pattern is different, 
comparing different steps in onset and progression of the 
disease. Specifically, prior studies in fibroblasts and in human 
mammary epithelial cells (Liao et al, 1998, 2000a, b) have 
demonstrated that only a subtle deregulation of expression of c- 
Myc is sufficient to allow genomic instability. These prior cell 
biologic findings raise the question of whether c-Myc protein 
expression precedes or follows its gene amplification during the 
course of die natural history of breast cancer. It will also be 
interesting for future studies of lower grade breast cancers and 
premalignant lesions to determine whether there is evidence of 
nuclear exclusion of c-Myc protein. Indeed, nuclear exclusion 
of c-Myc in high-grade tumours could serve to attenuate its 
functions in later stages of disease progression (Liao and Dickson, 
2000). 
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jjstjacts. If th ese nrin° r ceU proteins differ among cells to the same extent as the 
ivjjjQre abundant proteins, as is commonly assumed, only a small number of pro- 
nto differences (perhaps several hundred) suffice to create very large differences 
in cell morphology and behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to External Signals 3 

Most of the specialized cells in a multicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of cell specialization— different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc- 
tive character. These features reflect the persistent expression of different sets of 
genes. 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 4 

If differences between the various cell types of an organism depend on the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a cell can control the proteins it 
makes by (1) controlling when and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary RNA transcript is spliced or 
otherwise processed (RNA processing control), (3) selecting which completed 
mRNAs in the cell nucleus are exported to the cytoplasm (RNA transport con- 
trol), (4) selecting which mRNAs in the cytoplasm are translated by ribosomes 
(translational control), (5) selectively destabilizing certain mRNA molecules in 
toe cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figure 9-2). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 9-2, only transcrip- 
tional control en$ures that no superfluous intermediates are synthesized. In the 




Figure 9-2 Six steps at which 
eucaryote gene expression can be 
controlled. Only controls that operate 
at steps 1 through 5 are discussed in 
this chapter. The regulation of protein 
activity (step 6) is discussed in 
Chapter 5; this includes reversible 
activation or inactivation by protein 
phosphorylation as well as 
irreversible inactivation by proteolytic 
degradation. 
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following sections we discuss the DNA and protein components that regulate the 
initiation of gene transcription. We return at the end of the chapter to the other 
ways of regulating gene expression. 

Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction ofits genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals from 
other cells. Although all of the steps involved in expressing a gene can in principle be 
regulated, for most genes the initiation of RNA transcription is the most important 
point of control 



DNA-binding Motifs in Gene 
Regulatory Proteins 5 

How does a cell determine which of its thousands of genes to transcribe? As dis- 
cussed in Chapter 8, the transcription of each gene is controlled by a regulatory 
region of DNA near the site where transcription begins. Some regulatory regions 
are simple and act as switches that are thrown by a single signal. Other regula- 
tory regions are complex and act as tiny microprocessors, responding to a vari- 
ety of signals that they interpret and integrate to switch the neighboring gene on 
or off. Whether complex or simple, these switching devices consist of two fun- 
damental types of components: (1) short stretches of DNA of defined sequence 
and (2) gene regulatory proteins that recognize and bind to them. 

We begin our discussion of gene regulatory proteins by describing how these 
proteins were discovered. 



Gene Regulatory Proteins Were Discovered Using 
Bacterial Genetics 6 

Genetic analyses in bacteria carried out in the 1950s provided the first evidence 
of the existence of gene regulatory proteins that turn specific sets of genes on 
or off. One of these regulators, the lambda repressor, is encoded by a bacterial 
virus, bacteriophage lambda. The repressor shuts off the viral genes that code for 
the protein components of new virus particles and thereby enables the viral ge- 
nome to remain a silent passenger in the bacterial chromosome, multiplying with 
the bacterium when conditions are favorable for bacterial growth (see Figure 
6-80). The lambda repressor was among the first gene regulatory proteins to be 
characterized, and it remains one of the best understood, as we discuss later. 
Other bacterial regulators respond to nutritional conditions by shutting off genes 
encoding specific sets of metabolic enzymes when they are not needed. The lac 
repressor, for example, the first of these bacterial proteins to be recognized, turns 
off the production of the proteins responsible for lactose metabolism when this 
sugar is absent from the medium. 

The first step toward understanding gene regulation was the isolation of 
mutant strains of bacteria and bacteriophage lambda that were unable to shut 
off specific sets of genes. It was proposed at the time, and later proved, that most 
of these mutants were deficient in protfeins acting as specific repressors for these 
sets of genes. Because these proteins, like most gene regulatory proteins, are 
present in small quantities, it was difficult and time-consuming to isolate them. 
They were eventually purified by fractionating cell extracts on a series of stan- 
dard chromatography columns (see pp. 166-169). Once isolated, the pro- 
teins were shown to bind to specific DNA sequences close to the genes that they 



Figure 9-3 Double-helical structure 
of DNA. The major and minor grooves 
on the outside of the double helix are 
indicated. The atoms are colored as 
follows: carbon, dark blue; nitrogen 
light blue; hydrogen, white; oxygen 
red; phosphorus, yellow. 
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Figure 9-71 A mechanism to explain 
both the marked deficiency of CG 
sequences and the presence of CG 
islands in vertebrate genomes. A 
black line marks the location of an 
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DNA sequence, while a red line marks 
the location of a methylated CG 
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Summary 

the many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many spe- 
cialized animal cells can maintain their unique character when grown in culture, the 
gene regulatory mechanisms involved in creating them must be stable once estab- 
lished and heritable when the cell divides, endowing the cell with a memory of its 
developmental history. Procaryotes and yeasts provide unusually accessible model 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized cell types in higher eucaryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
teins, each of which inhibits the synthesis of the other, this can create a flip-flop 
switch that switches a cell between two alternative patterns of gene expression. Di- 
rector indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism for cell memory. 

In eucaryotes gene transcription is generally controlled by combinations of gene 
regulatory proteins. It is thought that each type of cell in a higher eucaryotic organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell A given gene regulatory pro- 
tein may be expressed in a variety of circumstances and typically is involved in the 
regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utilized by eucaryotic cells to regulate gene expression. In ver- 
tebrates DNA methylatipn also plays a part, mainly as a device to reinforce decisions 
about gene expression that are made initially by other mechanisms. 



Posttranscriptional Controls 

Although controls on the initiation of gene transcription are the predominant 
form of regulation for most genes, other controls can act later in the pathway 
from RNA to protein to modulate the amount of gene product that is made. Al- 
though these posttranscriptional controls, which operate after RNA polymerase 
jjas bound to the gene's promoter and begun RNA synthesis, are less common 
™n transcriptional control, for many genes they are crucial. It seems that every 
step to gene expression that could.be controlled in principle is likely to be regu- 
ec * under some circumstances for some genes. 

We consider the varieties of posttranscriptional regulation in temporal or- 
er ] acc °rding to the sequence of events that might be experienced by an RNA 
°tecule after its transcription has begun (Figure 9-72). 
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Figure 9-72 Possible post- 
transcriptional controls on gene 
expression. Only a few of these 
controls are likely to be used for any 
one gene. 
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Figure 6-3 Genes can be expressed 
with different efficiencies. Gene A Is 
transcribed and ^translated much more 
efficiently than gene B.This allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 
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FROM DNA TO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment-— most obviously by controlling 
the production of its RNA. 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA — 
the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-4). It differs from 
DNA chemically in twtf respects: (1) the nucleotides in RNA are 
ribonucleotides — that is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like T, can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). It is not uncommon, however, to find other types of base 
pairs in RNA; for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6). As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates that cause human disease. (A) Schematic illustration of the type of 
conformational change En a protein that produces material for a cross-beta filament. (B) Diagram illustrating 
the self-Infectious nature of the protein aggregation that Is central to prion diseases. PrP is highly unusual 
because the misfolded version of the protein, called PrP*, induces the normal PrP protein it contacts to 
change its conformation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggregation, but because this structure Is 
not Infectious In this way, it cannot spread from one animal to another. (C) Drawing of a cross-beta filament, 
a common type of protease-resistant protein aggregate found in a variety of human neurological diseases. 
Because the hydrogen-bond interactions In a JJ sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP*, showing the likely change of two a-helices into four 
p-stj-ands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form is not yet known with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpelt, adapted from M. Sunde et aLJ. Mot Wot 273:729-739, 
1997; D, adapted from S.B. Prusiner, Trends toochem. Sci. 21:482-487, 1996.) 



animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the "mad cow disease 0 ) from cattle to humans in Great Britain. 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an infectious conformation! a similar transformation has been discovered 
to be the cause of an otherwise mysterious "protein-only inheritance" observed 
in yeast cells. 

There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed. 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 
level of each protein in a eucaryotic cell 
depends upon the efficiency of each step 
depicted. 
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ure 6-90) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step— the transcription of its DNA sequence into an RNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonucleoprotein assembly called a ribosome. The 
amino acids used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mRNA (codons). The sequence of nucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl iRNAs—each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anticodon. Each amino acid is added to the 
C-terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryotic gene expression can be 
controlled. Controls that operate at 
steps I through 5 are discussed ip .this 
chapter. Step 6, the regulation of protein 
activity, includes reversible activation or 
inactlvation by protein phosphorylation 
(discussed in Chapter 3) as well as 
irreversible inactlvation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various ceU types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (1) controlling when and how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosol 
and determining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translational control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5) . 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
expression. 



Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction of its genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals 
from other cells. Although all of the steps involved in expressing a gene can in prin- 
ciple be regulated, for most genes the initiation of RNA transcription is the most 
. important point of control 



DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
trolled by a regulatory region of DNA relatively near the site where transcription 
begins. Some regulatory regions are simple and act as switches that are thrown 
oy a single signal. Many others are complex and act as tiny microprocessors, 
.responding to a variety of signals that they interpret and integrate to switch the 
^ghboring gene on or off. Whether complex or simple, these switching devices 
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occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However promoters of genes that remain active 
in the germ cell lineages (including most housekeeping genes) are kept 
unmethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired. Such regions are preserved in modem day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 

Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many cell 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 
cell divides. These features endow the cell with a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other; this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Direct or indirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis for cellular clocks. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also functions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these posttranscrlptional 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control, for 
many genes they are crucial. 
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Figure 7-86 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
into CG islands in vertebrate 
genomes. A black fine marks the location 
of a CG dinudeotide in the DNA 
sequence, while a red "lollipop" indicates 
the presence of a methyl group on the 
CG dinudeotide. CG sequences that lie in 
regulatory sequences of genes that are 
transcribed in germ cells are unmethylated 
and therefore tend to be retained In 
evolution. Methylated CG sequences, on 
the ojiier hand, tend to be lost through 
deamination of 5-methy! C toT, unless the 
CG sequence is critical for survival. 
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CHAPTER 29 



Regulation of transcription 



X\st phenolypic dinVrenccs lliai distinguish the 
various kinds nr cells in a higher trufcnryute are 
lAigetv due to" cJiITerernctrs in the expression of 
, f nes that code for proteins. that is. those tran- 
scribed hy HNA polymerase II. In principle, the 
fXpression of these {seiu'5 might he regulated al 
all v owe or several .singes. The concept or the 
-level or conlroP implies Hint gene expression 
' is pol necessarily an automatic process once il 
has begun- It could be regulated in a gene- 
5peciHc way al any one of several sequential 
steps. We can (listing tiislt <at least) five poteu- 
tial control points, forming the series: 

Activation or £t?m* struruuv 
1 

Initiation or transcription 
i 

Processing the transrripj 
I 

Transport to evtoplasm 
1 

Translation of mRSA 

The existence of the first slep is implied hy 
Hie discover? that genes may exist in either of 
two structural conditions. Helailve In ihe stale 
of most of the genome, genes are found in 
nu -active" state in the cells In which they 
are expressed (see Chapter The change of 
structure (S. distinct from the act of transcrip* 
lion, and indicates that the gene is transcribe 
, able." This suggests that acquisition of the 
! •active' structure musl be the first step in gene 
! expression. 

j Transcription of a gene in the active state is 



controlled at the stage of initiation, lliai Is. by 
the interaction of UNA polymerase with its pro- 
moter. This Is now becoming susceptible to 
analysis in the in vitro systems (ser Chapter 
2HJ. For most genes, this Is a mainr control 
point; probably it is ttie most common level of 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription in eukary- 
otic cells, for example* via ant tie rmi nation 
mechanisms. 

The primary transcript is modified by capping 
at the 5' end. and usually a?so hy polyadenyla- 
tion at the 3' end. Intro ns must \k spliced out 
frum the transcripts of interrupted genes. The 
mature AM must he exported renin the nucleus 
to Hie cytoplasm. Regulation or gene expression 
by selection of sequences nl the lc%ef of nuclear 
UNA might involve any or all of these stages, 
hut the one foe which we have most evidence 
concerns changes in splicing: some genes are 
expressed hy means uf alternative splicing pat* 
terns whose regulation controls the type or pro- 
tein product (see Chapter :«>). 

Finally, the translation of no mKNA In the cyto- 
plasm can he specifically controlled. There is little 
evidence for the employment of this mechanism in 
a dull somatic cells, but U does occur in some 
embryonic situations, as described in Chapter 7. 
-The mechanism is presumed to involve the block- 
ing of Initiation of translation or some mRNAs by 
specific protein factors. 

But having acknowledged that control of gene 
expression can occur at multiple stages, and 
that production of RNA cannot inevitably be 
equated with production of protein, it Is clear 
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that the overwhelming majority of regulatory 
events occur al lhe initiation or transcription. 
Regulation of tissue-specific gene transcription 
ties al the heart of eukaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins lhat regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves lo provide 



common control of a large number of largei 
genes, and we seek to answer two questions 
about this mode of regulation: what identifies 
the common target genes to the transcription 
factor; and how is the activity or the transcrip- 
tion factor itself regulated in response to imrin 
sic or extrinsic signals? 



Response elements identify genes under common 
regulation 



The principle that emerges from characterizing 
groups or genes under common control is that 
they share a promoter clement that is recognized 
by a regulator}' transcription factor. An element 
that causes a gene to respond to such a factor 
is colled a response element; examples are the 
HSE (heat shock response element), GRE 
(glucocorticoid response element), SRE (scrum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in. Table 29.1. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
TTiey contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
extends for- a short distance on either side of 



Table 29.1 f ncucitfe transcnpi.on Jacicrs bird tc 
response etumsnls that icfcnt.fy groups cJ pcmclers 
or cihancsrs subject lo coordinate ccr.tr cl. 



Regulatory Agent Module Consensus 



Factor 



He* Short HSE CWNGAANNTCCNNQ HSTF 

Glucocorticoid GRE TGGTACMATGnCT Receptor 

PhOfbol aster TOE TGACTCA APt 

Serum SRE OCATATTAGG SflF 



the consensus sequence, tn promoters, the ele- 
ments are not present at fixed distances from 
the startpoint, but are usually <200 Up upstream 
of It- The presence of a single element usually 
Is sufficient to confer the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in pf°" 
moters or in enhancers. Some types of eteoie" 15 
are typically found in one rather than the othr r 
usually an HSE is found in a promoter, *hil* a 
GRE is found in an enhancer. We assume th>* 
all response elements function by the *> ntf 
general principle. A gene is regulated fc 1 ' 0 
sequence at tlic promoter or enhancer thai * 
recognized by a specific protein. The P* 0 **!*, 
Junctions as a transcription factor needed ™ 
UNA polymerase to initiate.' Active protein * 
available onfy under conditions when the, 
to be expressed; its absence means that tht r** 
mater- is not activated by this particular <x reU ' | 

An example of a situation in which ^ 
genes are controlled by a single factor & J*\ 
vided by the heat shock response. This ^ 
mon lo a wide range of prokaryotes ^ 
eukaryoles and involves multiple co n,ro ^ ||f r 
gene expression; an increase in temp** 1 fl(t 
turns off transcription of some genes. wlf,s ^ 
transcription of the heat shock 
causes changes in the translation of ^ 
The control or the heal shock genes ^ 
the differences between *prokaryo lic „ 
eukaryotic modes of contmL In. bacteria, * 
Sigma factor is synthesized that ^ TtCl$ $ ^ef 
polymerase holoenzyme to recognize « n - 
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Background: Prostate stem cell antigen (PSCA) is a recently defined homologue of the Thy-i/Ly-6 family of 
gtycosylphosphatidylinositol (GPI)-anchored cell surface antigens. The purpose of the present study was to 
examine the expression status of PSCA protein and mRNA in clinical specimens of human prostate cancer (Pea) 
and to validate it as a potential molecular target for diagnosis and treatment of Pea. 

Materials and Methods: tmmunohistochemical (IHC) and in situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections from 20 benign prostatic hyperplasia 
(BPH), 20 prostatic intraepithelial neoplasm (PIN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
independent prostate cancers. The level of PSCA expression was semiquantitatlvery scored by assessing both the 
percentage and Intensity of PSCA-positive staining cells in the specimens. Then compared PSCA expression 
between BPH, PIN and Pea tissues and anafysed the correlations of PSCA expression level with pathological grade, 
clinical stage and progression to androgen-independence in Pea. 

Results: In BPH and low grade PIN, PSCA protein and mRNA staining were weak or negative and less intense 
and uniform than that seen in HGPIN and Pea. There were moderate to strong PSCA protein and mRNA 
expression in 8 of 1 1 (72.7%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by IHC and ISH analyses, 
with statistical significance compared with BPH (20%) and low grade PIN {22.2%) samples (p < 0.05, respectively). 
The expression level of PSCA increased with high Gleason grade, advanced stage and progression to androgen- 
independence (p < 0.05, respectively). In addition, IHC and ISH staining showed a high degree of correlation 
between PSCA protein and mRNA overexpression. 

Conclusions: Our data demonstrate that PSCA as a new cell surface marker is overexpressed by a majority of 
human Pea. PSCA expression correlates positively with adverse tumor characteristics, such as increasing 
pathological grade (poor ceil differentiation), worsening clinical stage and androgen-independence, and 
speculatively with prostate carcinogenesis. PSCA protein overexpression results from upregulated transcription 
of PSCA mRNA. PSCA may have prognostic utility and may be a promising molecular target for diagnosis and 
treatment of Pea. 
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Introduction 

Prostate cancer (Pea) is the second leading cause of can- 
cer-related death in American men and is becoming a 
common cancer increasing in China. Despite recently 
great progress in the diagnosis and management of local- 
ized disease, there continues to be a need for new diagnos- 
tic markers that can accurately discriminate between 
indolent and aggressive variants of Pea. There also contin- 
ues to be a need for the identification and characterization 
of potential new therapeutic targets on Pea cells. Current 
diagnostic and therapeutic modalities for recurrent and 
metastatic Pea have been limited by a lack of specific tar- 
get antigens of Pea. 

Although a number of prostate-specific genes have been 
identified (i.e. prostate specific antigen, prostatic acid 
phosphatase, glandular kallikrein 2), the majority of these 
are secreted proteins not ideally suited for many immuno- 
logical strategies. So, the identification of new cell surface 
antigens is critical to the development of new diagnostic 
and therapeutic approaches to the management of Pea, 

Reiter RE et al [1] reported the identification of prostate 
stem cell antigen (PSCA), a cell surface antigen that is pre- 
dominantly prostate specific. The PSCA gene encodes a 
123 amino acid glycoprotein, with 30% homology to 
stem cell antigen 2 (Sea 2). Like Sea- 2, PSCA also belongs 
to a member of theThy-l/Ly-6 family and is anchored by 
a glycosylphosphatidyl inositol (GPI) linkage. mRNA in 
situ hybridization (ISH) localized PSCA expression in nor- 
mal prostate to the basal cell epithelium, the putative 
stem cell compartment of prostatic epithelium, suggesting 
that PSCA may be a marker of prostate stem/progenitor 
cells. 

In order to examine the status of PSCA protein and mRNA 
expression in human Pea and validate it as a potential 
diagnostic and therapeutic target for Pea, we used immu- 
nohistochemistry (IHC) and in situ hybridization (ISH) 
simultaneously, and conducted PSCA protein and mRNA 
expression analyses in paraffin-embedded tissue speci- 
mens of benign prostatic hyperplasia (BPH, n « 20), pros- 
tate intraepithelial neoplasm (PIN, n = 20) and prostate 
cancer (Pea, n = 48). Furthermore, we evaluated the possi- 
ble correlation of PSCA expression level with Pea tumori- 
genesis, grade, stage and progression to androgen- 
independence. 

Materials and methods 
Tissue samples 

All of the clinical tissue specimens studied herein were 
obtained from 80 patients of 57-84 years old by prostate- 
ctomy, transurethral resection of prostate (TURP) or biop- 
sies. The patients were classified as 20 cases of BPH, 20 
cases of PIN, 40 cases of primary Pea, including 9 patients 



with recurrent Pea and a history of androgen ablation 
therapy (orchiectomy and/or hormonal therapy), who 
were referred to as androgen-independent prostate can- 
cers. Eight specimens were harvested from these andro- 
gen-independent Pea patients prior to androgen ablation 
treatment. Each tissue sample was cut into two parts, one 
was fixed in 10% formalin for IHC and the other treated 
with 4% paraformaldehyde/0.1 M PBS PH 7.4 in 0.1% 
DEPC for 1 h for ISH analysis, and then embedded in par- 
affin. All paraffin blocks examined were then cut into 5 
urn sections and mounted on the glass slides specific for 
IHC and ISH respectively in the usual fashion. H&E- 
stained section of each Pea was evaluated and assigned a 
Gleason score by the experienced urological pathologist at 
our institution based on the criteria of Gleason score [2]. 
The Gleason sums are summarized in Table 1. Clinical 
staging was performed according to Jewett-whitmore- 
prout staging system, as shown in Table 2. In the category 
of PIN, we graded the specimens into two groups, i.e. low 
grade PIN (grade I - II) and high grade PIN (HGP1N, 
grade III) on the basis of literatures |3,4J. 

Immunohistochemicat (IHC) analysis 
Briefly, tissue sections were deparaffinized, dehydrated, 
and subjected to microwaving in 10 mmol/L citrate 
buffer, PH 6.0 (Boshide, Wuhan, China) in a 900 W oven 
for 5 min to induce epitope retrieval. Slides were allowed 
to cool at room temperature for 30 min. A primary mouse 
antibody specific to human PSCA (Boshide, Wuhan, 
China) with a 1:100 dilution was applied to incubate with 
the slides at room temperature for 2 h. Labeling was 
detected by sequentially adding biotinylated secondary 
antibodies and strepavidin-peroxidase, and localized 
using 3,3'-diaminobenzidine reaction. Sections were then 
counterstained with hematoxylin. Substitution of the pri- 
mary antibody with phosphate-buffered-saline (PBS) 
served as a negative-staining control. 

mRNA in situ hybridization (ISH) 

Five-um-thick tissue sections were deparaffinized and 
dehydrated, then digested in pepsin solution (4 mg/ml in 
3% citric acid) for 20 min at 37.5 °C, and further proc- 
essed for ISH. Digoxigenin-labeled sense and antisense 
human PSCA RNA probes (obtained from Boshide, 
Wuhan, China) were hybridized to the sections at 48 °C 
overnight. The posthybridization wash with a high strin- 
gency was performed sequentially at 37° C in 2 x standard 
saline citrate (SSC) for 10 min, in 0.5 x SSC for 15 min 
and in 0.2 x SSC for 30 min. The slides were then incu- 
bated to biotinylated mouse anti-digoxigenin antibody at 
37.5*Cforl h followed by washing in 1 x PBS for 20 min 
at room temperature, and then to strepavidin-peroxidase 
at 37.5°C for 20 min followed by washing in 1 x PBS for 
15 min at room temperature. Subsequently, the slides 
were developed with diaminobenzidine and then coun- 
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Table 1: Correlation ofPSCA expression with Gleason score 






Intensity x frequency 


Gleason score 


0-6 (%) 


9(%) 


2-4 
5-7 

a-io 


5(83) 
19(79) 
S(28) 


1(17) 
5(21) 
13(72) 




Table 2: Correlation of PSCA expression with clinical stage 


Intensity * frequency 


Tumor stage 


0-6 (%) 


9(56) 


2>C 


27 (67.5) 
2(25) 


13(32.5) 
6(75) 



terstained with hematoxylin to localize the hybridization 
signals. Sections hybridized with the sense control probes 
routinely did not show any specific hybridization signal 
above background. All slides were hybridized with PBS to 
substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCA 
immunostaining and mRNA in situ hybridization, the 
same scoring manners are taken in the present study for 
PSCA protein staining by IHC and PSCA mRNA staining 
by ISH. Each slide was read and scored by two independ- 
endy experienced urological pathologists using Olympus 
BX-41 light microscopes. The evaluation was done in a 
blinded fashion. For each section, five areas of similar 
grade were analyzed semiquantitatively for the fraction of 
cells staining. Fifty percent of specimens were randomly 
chosen and rescored to determine the degree of interob- 
server and intraobserver concordance. There was greater 
than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopi- 
cally was graded on a scale of 0 to 3+ with 3 being the 
highest expression observed (0, no staining; 1+, mildly 
intense; 2+, moderately intense; 3+, severely intense). The 
staining density was quantified as the percentage of cells 
staining positive for PSCA with the primary antibody or 
hybridization probe, as follows: 0 = no staining; 1 = posi- 
tive staining in <25% of the sample; 2 =» positive staining 
in 25%-50% of the sample; 3 = positive staining in >50% 



of the sample. Intensity score (0 to 3+) was multiplied by 
the density score (0-3) to give an overall score of 0-9 
[1,5]. In this way, we were able to differentiate specimens 
that may have had focal areas of increased staining from 
those that had diffuse areas of increased staining [6]. The 
overall score for each specimen was then categorically 
assigned to one of the following groups: 0 score, negative 
expression; 1-2 scores, weak expression; 3-6 scores, mod- 
erate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expres- 
sion in BPH, PIN and Pea tissues were compared using the 
Chi-square and Student's c-test. Univariate associations 
between PSCA expression and Gleason score, clinical 
stage and progression to androgen- independence were 
calculated using Fisher's Exact Test. For all analyses, p < 
0.05 was considered statistically significant. 

Results 

PSCA expression in BPH 

In general, PSCA protein and mRNA were expressed 
weakly in individual samples of BPH. Some areas of 
prostate expressed weak levels (composite score 1-2), 
whereas other areas were completely negative (composite 
score 0). Four cases (20%) of BPH had moderate expres- 
sion of PSCA protein and mRNA (composite score 4-6) 
by IHC and ISH. In 2/20 (10%) BPH specimens, PSCA 
mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 
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2) in one and negative (composite score 0) in the other. 
PSCA expression was localized to the basal and secretory 
epithelial cells, and prostatic stroma was almost negative 
staining for PSCA protein and mRNA in all cases 
examined. 

PSCA expression in PIN 

In this study, we detected weak or negative expression of 
PSCA protein and mRNA (£2 scores) in 7 of 9 (77.8%) 
low grade PIN and in 2 of 11 (18.2%) HGPIN, and mod- 
erate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 11 (45.5%) HGPIN. One HGPIN with moderate 
PSCA mRNA expression (6 score) was found weak stain- 
ing for PSCA protein (2 score) by IHC. Strong PSCA pro- 
tein and mRNA expression (9 score) were detected in the 
remaining 3 of 1 1 (27.3%) HGPIN. There was a statisti- 
cally significant difference of PSCA protein and mRNA 
expression levels observed between HGPIN and BPH (p < 
0.05), but no statistical difference reached between low 
grade PIN and BPH (p > 0.05). 

PSCA expression in Pea 

In order to determine if PSCA protein and mRNA can be 
detected in prostate cancers and if PSCA expression levels 
are increased in malignant compared with benign glands, 
Forty-eight paraffin-embedded Pea specimens were ana- 
lysed by IHC and ISH. It was shown that 19 of 48 (39.6%) 
Pea samples stained very strongly for PSCA protein and 
mRNA with a score of 9 and another 21 (43.8%) speci- 
mens displayed moderate staining with scores of 4-6 (Fig- 
ure 1). In addition, 4 specimens with moderate to strong 
PSCA mRNA expression (scores of 4-9) had weak protein 
staining (a score of 2) by IHC analyses. Overall, Pea 
expressed a significandy higher level of PSCA protein and 
mRNA than any other specimen category in this study (p 
< 0.05, compared with BPH and PIN respectively). The 
result demonstrates that PSCA protein and mRNA are 
overexpressed by a majority of human Pea, 

Correlation of PSCA expression with Gleason score in Pea 
Using the semi-quantitative scoring method as described 
in Materials and Methods, we compared the expression 
level of PSCA protein and mRNA with Gleason grade of 
Pea, as shown in Table l. Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-differentia- 
tion, 5-7 scores = moderate-differentiation and 8-10 
scores = poor-differentiation [7 J. Seventy-two percent of 
Gleason scores 8-10 prostate cancers had very strong 
staining of PSCA compared to 21% with Gleason scores 
5-7 and 17% with 2-4 respectively, demonstrating mat 
poorly differentiated Pea had significantly stronger 
expression of PSCA protein and mRNA than moderately 
and well differentiated tumors (p < 0.05). As depicted in 
Figure 1, IHC and ISH analyses showed that PSCA protein 
and mRNA expression in several cases of poorly differen- 



tiated Pea were particularly prominent, with more intense 
and uniform staining. The results indicate that PSCA 
expression increases significandy with higher tumor grade 
in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 

With regards to PSCA expression in every stage of Pea, we 
showed the results in Table 2. Seventy-five percent of 
locally advanced and node positive cancers (i.e. C-D 
stages) expressed statistically high levels of PSCA versus 
32.5% that were organ confined (i.e. A-B stages) (p < 
0.05). The data demonstrate that PSCA expression 
increases significantly with advanced tumor stage in 
human Pea. 

Correlation of PSCA expression with androgen- 
independent progression of Pea 

All 9 specimens of androgen-independent prostate can- 
cers stained positive for PSCA protein and mRNA. Eight 
specimens were obtained from patients managed prior to 
androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the 
strongest staining category (score = 9), compared with 
three out of eight (37.5%) of patients with androgen- 
dependent cancers (p < 0.05). The results demonstrate 
that PSCA expression increases significantly with progres- 
sion to androgen-independence of human Pea. 

It is evident from the results above that within a majority 
of human prostate cancers the level of PSCA protein and 
mRNA expression correlates significandy with increasing 
grade, worsening stage and progression to androgen-inde- 
pendence. 

Correlation of PSCA immunostaining and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the 
results of PSCA IHC staining with mRNA ISH analysis. 
Positive staining areas and its intensity and density scores 
evaluated by IHC were identical to those seen by ISH in 79 
of 88 (89.8%) specimens (18/20 BPH, 19/20 PIN and 42/ 
48 Pea respectively). Importantly, 27/27 samples with 
PSCA mRNA composite scores of 0-2, 32/36 samples 
with scores of 3-6 and 22/24 samples with a score of 9 
also had PSCA protein expression scores of 0-2, 3-6 and 
9 respectively. However, in 5 samples with PSCA mRNA 
overall scores of 3-6 and in 2 with scores of 9 there were 
less or negative PSCA protein expression (i.e. scores of 0- 
4), suggesting that this may reflect posttranscriptional 
modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data 
demonstrate that the results of PSCA immunostaining 
were consistent with those of mRNA ISH analysis, show- 
ing a high degree of correlation between PSCA protein 
and mRNA expression. 
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Figure I 

Representatives of PSCA IHC and ISH staining in Pea (A. IHC staining, B. ISH staining, x200 magnification). A h B,: negative con- 
trol of IHC and ISH. PBS replacing the primary antibody (A,) and hybridization with a sense PSCA probe (Bj) showed no back- 
ground staining. A 2 , Bj: a moderately differentiated Pea (Gleason score = 3+3 = 6) with moderate staining (composite score = 
6) in all malignant cells; A 2 : IHC shows not only cell surface but also apparent cytoplasmic staining of PSCA protein. A 3 , B 3 : a 
poorly differentiated Pea (Gleason score = 4+4 = 8) with very strong staining (composite score = 9) in all malignant cells. 
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Discussion 

PSCA is homologous to a group of cell surface proteins 
that mark the earliest phase of hematopoietic develop- 
ment. PSCA mRNA expression is prostate-specific in nor- 
mal male tissues and is highly up-regulated in both 
androgen-dependent and-independeni Pea xenografts 
(IAPC-4 tumors). We hypothesize that PSCA may play a 
role in Pea tumorigenesis and progression, and may serve 
as a target for Pea diagnosis and treatment. In this study, 
IHC and ISH showed that in general there were weak or 
absent PSCA protein and mRNA expression in BPH and 
low grade PIN tissues. However, PSCA protein and mRNA 
are widely expressed in HGP1N, the putative precursor of 
invasive Pea, suggesting that up-regulation of PSCA is an 
eady event in prostate carcinogenesis. Recently, Reiter RE 
et al [1 ], using ISH analysis, reported that 97 of 1 18 (82%) 
HGPIN specimens stained strongly positive for PSCA 
mRNA. A very similar finding was seen on mouse PSCA 
(mPSCA) expression in mouse HGPIN tissues by Tran C. 
P et al [8]. These data suggest that PSCA may be a new 
marker associated with transformation of prostate cells 
and tumorigenesis. 

We observed that PSCA protein and mRNA are highly 
expressed in a large percentage of human prostate cancers, 
including advanced, poorly differentiated, androgen- 
independent and metastatic cases. Fluorescence-activated 
cell sorting and confocal/ immuno fluorescent studies 
demonstrated cell surface expression of PSCA protein in 
Pea cells [9). Our IHC expression analysis of PSCA shows 
not only cell surface but also apparent cytoplasmic stain- 
ing of PSCA protein in Pea specimens (Figure 1 ). One pos- 
sible explanation for this is that anti-PSCA antibody can 
recognize PSCA peptide precursors that reside in the cyto- 
plasm. Also, it is possible that the positive staining that 
appears in the cytoplasm is actually from the overlying 
cell membrane [5 J. These data seem to indicate that PSCA 
is a novel cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression 
correlates with high grade (i.e. poor differentiation), 
increased tumor stage and progression to androgen-inde- 
pendence of Pea. These findings support the original IHC 
analyses by Gu 2 et al [9], who reported that PSCA protein 
expressed in 94% of primary Pea and the intensity of 
PSCA protein expression increased with tumor grade, 
stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al 
[10], in which the significant association between high 
PSCA expression and adverse prognostic features such as 
high Gleason score, seminal vesicle invasion and capsular 
involvement in Pea was found. It is suggested that PSCA 
overexpression may be an adverse predictor for recur- 
rence, clinical progression or survival of Pea. Hara H et al 
(11] used RT-PCR detection of PSA, PSMA and PSCA in 1 



ml of peripheral blood to evaluate Pea patients with poor 
prognosis. The results showed that among 58 PCa 
patients, each PCR indicated the prognostic value in the 
hierarchy of PSCA>PSA>PSMA RT-PCR, and extraprostatic 
cases with positive PSCA PCR indicated lower disease-pro- 
gression-free survival than those with negative PSCA PCR, 
demonstrating that PSCA can be used as a prognostic fac- 
tor. Dubey P et al |12] reported that elevated numbers of 
PSCA + cells correlate positively with the onset and devel : 
opment of prostate carcinoma over a long time span in 
the prostates of the TRAMP and PTEN +/- models com- 
pared with its normal prostates. Taken together with our 
present findings, in which PSCA is overexpressed from 
HGPIN to almost frank carcinoma, it is reasonable and 
possible to use increased PSCA expression level or 
increased numbers of PSCA-positive ceils in the prostate 
samples as a prognostic marker to predict the potential 
onset of this cancer. These data raise the possibility that 
PSCA may have diagnostic utility or clinical prognostic 
value in human Pea. 

The cause of PSCA overexpression in Pea is not known. 
One possible mechanism is that it may result from PSCA 
gene amplification. In humans, PSCA is located on chro- 
mosome 8q24.2 [1], which is often amplified in meta- 
static and recurrent Pea and considered to indicate a poor 
prognosis 113-15]. Interestingly, PSCA is in close proxim- 
ity to the c-myc oncogene, which is amplified in >20% of 
recurrent and metastatic prostate cancers [16,17]. Reiter 
RE et al 1 18] reported that PSCA and MYC gene copy num- 
bers were co-amplified in 25% of tumors (five out of 
twenty), demonstrating that PSCA overexpression is asso- 
ciated with PSCA and MYC coamplification in Pea. Gu Z 
et al [9] recently reporteted that in 102 specimens availa- 
ble to compare the results of PSCA immunostaining with 
their previous mRNA ISH analysis, 92 (90.2%) had iden- 
tically positive areas of PSCA protein and mRNA expres- 
sion. Taken together with our findings, in which we 
detected moderate to strong expression of PSCA protein 
and mRNA in 34 of 40 (85%) Pea specimens examined 
simultaneously by IHC and ISH analyses, it is demon- 
strated that PSCA protein and mRNA overexpressed in 
human Pea, and that the increased protein level of PSCA 
was resulted from the upregulated transcription of its 
mRNA. 

At present, the regulation mechanisms of human PSCA 
expression and its biological function are yet to be eluci- 
dated. PSCA expression may be regulated by multiple fac- 
tors [18], WatabeTet al (19] reported that transcriptional 
control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be 
regulated or mediated through cell-cell contact and pro- 
tein kinase C (PKC) [20]. Homologues of PSCA have 
diverse activities, and have themselves been involved in 
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carcinogenesis. Signalling through SCA-2 has been dem- 
onstrated to prevent apoptosis in immature thymocytes 
[21]. Thy-1 is involved in T cell activation and transducts 
signals through src-like tyrosine kinases [22]. Ly-6 genes 
have been implicated both in tumorigenesis and in cell- 
cell adhesion [23-25]. Cell-cell or cell-matrix interaction is 
critical for local tumor growth and spread to distal sites. 
From its restricted expression in basal cells of normal 
prostate and its homology to SCA-2, PSCA may play a role 
in stem/progenitor cell function, such as self-renewal (i.e. 
anti-apoptosis) and/or proliferation [1]. Taken together 
with the results in the present study, we speculate that 
PSCA may play a role in tumorigenesis and clinical pro- 
gression of Pea through affecting cell transformation and 
proliferation. From our results, it is also suggested that 
PSCA as a new cell surface antigen may have a number of 
potential uses in the diagnosis, therapy and clinical prog- 
nosis of human Pea. PSCA overexpression in prostate 
biopsies could be used to identify patients at high risk to 
develop recurrent or metastatic disease, and to discrimi- 
nate cancers from normal glands in prostatectomy sam- 
ples. Similarly, the detection of PSCA-overexpressing cells 
in bone marrow or peripheral blood may identify and pre- 
dict metastatic progression better than current assays, 
which identify only PSA-positive or PSMA-positive pros- 
tate cells. 

In summary, we have shown in this study that PSCA pro- 
tein and mRNA are maintained in expression from 
HGPIN through all stages of Pea in a majority of cases, 
which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell dif- 
ferentiation), advanced stage and androgen -independent 
progression. PSCA protein overexpression is due to the 
up regulation of its mRNA transcription. The results sug- 
gest that PSCA may be a promising molecular marker for 
the clinical prognosis of human Pea and a valuable target 
for diagnosis and therapy of mis tumor. 
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column "DNA Id" identifies the particular lot of PCR product used. The microarray experiments 
were performed using well-established and accepted microarray techniques known in the art 
{See, e.g., Nature Revs. Genetics, 5:229-237 (2004), attached as Exhibit C). The DNA samples 
used in the microarray studies were obtained from individual lung tumor tissue samples or 
individual normal lung tissue samples. The individual tumor and normal lung samples were each 
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compared to pooled samples of normal epithelial tissue. The level of expression in the lung 
tumor or normal lung tissue sample was compared to the normal pooled epithelial sample, and 
reported as a raw ratio. The average of the normal lung samples was then used to normalize the 
data to generate a ratio of expression of the PR0539 gene in lung tumor samples compared to the 
average expression in normal lung tissue. In the results reported in Exhibit B, a ratio of 2.0 or 
greater is a significant result, and indicates a significant increase in expression of the PR0539 
gene in lung tumor tissue compared to the normal lung tissue controls. 

5. The results of the microarray studies reported in Exhibit B indicate that the gene 
encoding PR0539 (DNA47465) is significantly overexpressed in eight of the twenty-six lung 
tumor samples tested compared to the normal lung tissue controls. That is the equivalent of 
nearly one in every three samples (31%). In addition, four out of five squamous cell lung 
carcinomas (80%) are significantly overexpressed (shown in bold). In contrast, only one of the 
seven individual normal lung tissue samples shows significant overexpression of the PR0539 
gene (14%). 

6. It is well-established in the art that overexpression of the mRNA for a gene is 
likely to lead to overexpression of the corresponding protein. Support for this statement can be 
found, for example, in the Molecular Biology of the Cell, a leading textbook in the field. (Bruce 
Alberts, et aL, Molecular Biology of the Cell (4 th ed. 2002), excerpts submitted herewith as 
Exhibit D). Figure 6-3 on page 302 illustrates the basic principle that there is a correlation 
between increased gene expression and increased protein expression. The accompanying text 
states that "a cell can change (or regulate) the expression of each of its genes according to the 
needs of the moment - most obviously by controlling the production of its mRNA." Molecular 
Biology of the Cell at 302, emphasis added. Similarly, figure 6-90 on page 364 illustrates the 
path from gene to protein. The accompanying text states that while potentially each step can be 
regulated by the cell, "the initiation of transcription is the most common point for a cell to 
regulate the expression of each of its genes." Molecular Biology of the Cell at 364. This point is 
repeated on page 379, where the authors state that of all the possible points for regulating protein 
expression, "[f]or most genes transcriptional controls are paramount." Molecular Biology of the 
Cell jit 379. . 

7. While not every lung tumor sample tested shows overexpression of the PR0539 
gene, the data in Exhibit B indicate that a significant portion of lung tumors do (30% of all lung 
tumors tested and 80% of squamous cell tumors tested), while only one of the normal lung tissue 
samples shows overexpression (14%). Given the known correlation between overexpression of a 
gene and the corresponding overexpression of the encoded protein, it is very likely that a similar 
number of lung tumors will overexpress the PR0539 protein, while very few normal lung tissue 
samples likely will. Together with the data reported in Example 16 that the gene encoding 
PR0539 is amplified in some lung tumors, including squamous cell lung carcinoma, the results 
reported m Exhibit B indicate that the PR0539 gene and protein, as well as antibodies to the 
encoded protein, can be used to differentiate some cancerous lung tissue, particularly squamous 
cell carcinoma, from normal lung tissue. Because not all lung tumors show overexpression of 
PR0539, it cannot be used to exclude a sample being tested as non-cancerous. However, the 
PR0539 gene, protein, and corresponding antibodies are useful as a diagnostic tool for lung 
cancer, particularly squamous cell carcinoma, since a very high percentage of squamous cell lung 
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EXHIBIT A 



VICTORIA SMITH 



Genentech Inc. 
Dept. Molecular Biology 
1 DN A Way- 
South San Francisco CA 94080 
Ph: (650) 225 7382 
Fax: (650) 225 6497 
Email: victoria@g ene.cnm 

Education 



Ph.D. (1991) Molecular Biology, University of Cambridge, Cambridge, United 
Kingdom. 

Honors (1987) Biochemistry, University of Western Australia, Australia. 
Bachelor of Science (1986) Physical and Inorganic Chemistry, and Biochemistry, 
University of Western Australia, Australia. 



Work and Research experience 



Senior Scientist, Genentech Inc (August 1996 - present; promoted to Senior Scientist 
March 2001) 

Lab head, Dept. Molecular Biology 

- Identification of potential therapeutic targets for cancer using novel microarray 
technology 

- Discovery and identification of novel secreted proteins 

- Development of cancer therapuetics 

Stanford University, California, U.S.A. (February 1995 - August 1996) 
Research Fellow, Department of Biochemistry 

Research: Genomic functional analysis of chromosome V of Saccharomyces cerevisiae 

Stanford University, California, U.S. A (January 1992 - January 1995) 
Postdoctoral Fellow, Department of Genetics. 
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Research: Development of new methodology for whole genome functional analysis in 
microorganisms using genomic sequence data and insertional mutagenesis. 
Awards 

Human Frontiers Science Program Organization Long Term FeUowship (accepted, 
4/01/93-1/31/95). 

American Cancer Society (California Division) FeUowship (1993, declined). 

Cambridge University, United Kingdom {October 1988 - December 1991) 
Research undertaken at the Medical Research Council Laboratory of Molecular Biology, 
Cambridge, UK, for the degree of Doctor of Philosophy, Cambridge University. 
Thesis: A Molecular Genetic Analysis of Yeast Chromosome DC 
Thesis Advisor: Dr. Barclay BarreU. 

Awards 

Max Perutz Prize in 1991 for outstanding performance by a graduate student. Awarded 
for advances in genomic-scale DNA sequencing methodology, and genetic analysis of 
the SNP1 gene of Saccharomyces cerevisiae . 

King Edward Memorial Hospital for Women, Western Australia (1988) 
Research Scientist. 

Research: Analysis of hormone-inducible mRNAs in breast tumors. 
University of Western Australia (1984 - 1987) 

1984 - 1986: Bachelor of Science degree with double major in Biochemistry and Physical 
and Inorganic Chemistry. 

1987: First Class Honors in Biochemistry. Thesis: Nuclease Sensitivity and Methylation 
Patterns of die Phenylalanine Hydroxylase Gene. 

Awards 

Association of Commonwealth Universities Scholarship for study in the United 
Kingdom (accepted, October 1988 - October 1991, University of Cambridge). 
Hackett Scholarship for overseas study (1988, declined). 

Lady James Prize in Natural Science in 1986: Best student completing Bachelor of 
Science degree with a major in a natural science. 

JWH Lugg Prize in Biochemistry in 1986: Best student completing major in 
Biochemistry. 
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Convocation Prize in Science in 1985: Best student completing major in second year 
Physics, Geology or Chemistry. 

Shell Prize in Chemistry in 1985: Best student completing major in second year 
Chemistry. 



Publications 

^ Smim - ER Shen ' D - Wieand, T.H. Landon, NLA. Wong, A.M. Lessells, S. Paterson- 
Brown, T.D. Wu, J.Z. Tang, KJ. Hillan, I.D. Penman, Expression Analysis of the 
Metaplasia-Dysplasia-Carcinoma Sequence in Barrett's Esophagus (submitted). 

Tice DA. Szeto W. Soloviev I. Rubinfeld B. Fong SE. Dugger DL. Winer J. Williams PM. 
Wieand D. Smith V . Schwall RH. Pennica D. Polakis P. Journal of Biological Chemistry 
277(16):14329-35, 2002 Apr 19. 

N.J. Maughan, F. Lewis, V.Smith (2001) Journal of Pathology 195, 3-6. 

F. Lewis, N.J. Maughan, V.Smith, K. Hillan, P. Quirke (2001) Journal of Pathology 195, 66- 

D.J. Garfinkel, M.J. Curcio and V.Smith (1998) Ty Mutagenesis Methods in Microbiology, 
volume 26, 101-117. 

G Churcher, S. Bowman, K. Badcock, A. Bankier, D. Brown, T. Chillingworth, R. 
Connor, K. Devlin, S. Gentles, N. Hamlin, D. Harris, T. Horsnell, S. Hunt, K. Jagels, M. 
Jones, G. Lye, S. Moule, C Odell, D. Pearson, M. Rajandream, P. Rice, N. Rowley, J. 
Skelton, V. Smith, S. Walsh, S. Whitehead & B. Barrell. (1997) Nature, 387, 84-87. ' 

F. S. Dietrich, J. Mulligan, K. Hennessy, M. A. Yelton, E. Allen, R. Araujo, E. Aviles, A. 
Berno, T. Brennan, J. Carpenter, E. Chen, J. M. Cherry, E. Chung, M. Duncan, E. 
Guzman, G. HartzeU, S. Hunicke-Smith, R. W. Hyman, A. Kayser, C. Komp, D. 
Lashkari, H. Lew, D. Lin, D. Mosedale, K. Nakahara, A. Namath, R. Norgren, P. Oefner, 
C. Oh, F.X. Petel, D. Roberts, P. Sehl, S. Schramm, T. Shogren, V. Smith. P. Taylor, Y. 
Wei, D.Botstein&R. W.Davis. (1997) Nature 387, 78-81. 
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V. Smith, K. Chou, D. Lashkari, D. Botstein, and P. O. Brown. (1996). Functional 
Analysis of the Genes of Yeast Chromosome V by Genetic Footprinting. Science 274 
2069-74 

V. Smith, D. Botstein, and P. O. Brown (1995) Genetic Footprinting: A genomic strategy 
for detennining a gene's function given its sequence. Proc. Natl Acad. Sci. U.SA., 92, 
6479-6483. 

v - Smith r M. Craxton, A. T. Bankier, C. M. Brown, W. D. Rawlinson, M. Chee, and B. G. 
Barrell (1995) Microtiter methods for the preparation and fluorescent sequencing of M13 
clones. Recombinant DNA Methodology H: a volume in the Selected Methods in Enzymology 
Series, pp. 607 - 621. 

V. Smith, M. Craxton, A. T. Bankier, C. M. Brown, W. D. Rawlinson, M S. Chee, and B. 
G. Barrell (1993) Microtiter methods for the preparation and fluorescent sequencing of 
M13 clones. Methods in Enzymology, 218 , 173-187. 

V- Smith and M - S. Chee (1991) A simple method for sequencing the complementary 
strand of ssDNA from M13 clones. Nucleic Acids Research 19, 6957. 

V- Smith B. G. Barrell (1991) Cloning of a Yeast Ul snRNP 70K protein homologue: 
functional conservation of an RNA binding domain between humans and yeast. EMBO 
/ournallO, 2627-2643. 

W. D. Rawlinson, M. S. Chee, V.Smith and B. G. Barrell (1991) Preparation of large 
numbers of single stranded DNA templates by rescue from phagemids in microtiter 
plates. Nucleic Acids Research 19, £779. 

V.Smith, C. M. Brown, A. T. Bankier, and B. G. Barrell (1990) Semi-automated 
preparation of DNA templates for large scale sequencing projects. DNA Sequence 1, 73- 
78. 

S. J. Wysocki, E. Hahnel, S. P. Wilkinson, V. Smith, and R. Hahnel (1990) Hormone- 
sensitive gene expression in breast tumors. Anticancer Research 10, 185-188 
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S. J. Wysocki, E. Hahnel, A. Masters, V. Smith, A. J. McCartney and R. Hahnel (1990) 

Detection of pS2 messenger RNA in gynecological cancers. Cancer Research 50 1800- 
1802 



Patents 



Genetic Footprinting: Insertional Mutagenesis and Genetic Selection. U.S. Patent No. 
5,612,180. Inventors: Patrick Brown and Victoria Smith 

PATENTS FILED ( at Genentech Inc.) 



Methods of Detecting and Quantifying Gene Expression. Inventors: Victoria Smith, 
Edward Robbie, David Lowe, James Marsters. 

Compositions and Methods for the Treatment of Cancer. Inventors: Victoria Smith, 
Austin Gurney, Audrey Goddard, Fred DeSauvage 

Diagnostic for Dysplasia in Barrett's Esophagus. Inventor: Victoria Smith 

Numerous composition of matter filings related to novel gene discovery, pending 
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EXHIBIT B 



Unqld DNAId 
Lung Tumor Samples 



Experiment Name 



Raw Ratio 
(sample/pooled 
epithelial) 



Normalized Ratio 
(sample/normal lung) 



340. 208,278. lung tumor R1 057 vs epi pool 

340. 208,278. lung tumor R1 094 vs epi pool 

340. 208,278. lung tumor R1 372 vs epi pool 

340. 208,278. lung tumor R1417 vs epi pool 

340. 208,278. lung tumor R1431 vs epi pool 

340. 208,278. lung tumor R419 vs epi pool 

340. 208,278. lung tumor R542 vs epi pool 

340. 208,278. lung tumor R544 vs epi pool 

340. 208,278. lung tumor R685vs epi pool 

340. 208,278. lung tumor R688 vs epi pool 

340. 208,278. lung tumor R693 vs epi pool 

340. 208,278. lung tumor R737 vs epi pool 

340. 208,278. lung tumor R742 vs epi pool 

340. 295,413. lung tumor 

340. 208,278. lung tumor \R543 vs epi pool 

340. 208,278. Iungtumdr1604/REF.RNA 

340. 208,278. lung tumor1683 

340. 208,278. lung tumor1853 

340. 208,278. lung tumor2454/ref.RNA 

340. 208,278. LungCaSq-hf1602 

340. 208,278. LungCaSq-hf1647 ' 

340. 208,278. LungSqCa-hf1293 

340. 208,278. LungSqCa-hf1646 

340. 208,278. LungSqCa-hf? 

340. 208,278. 1 0OngLungBaCal 069 vs 25ngEpi1 409 

340. 208,278. 100ngLungBaCa994 vs 25ngEpi1409 



3.429 
1.716 
0.046 
1.106 
0.242 
1.413 
0.429 
0.735 
0.247 
0.897 
0.755 
1.508 
6.313 
3.213 
0.402 
1.06 

11.523 
0.067 

11.028 
1.103 
2.794 
4.728 
5.399 
6.306 
0.376 
1.706 



2.463 
.1.232 
0.033 
0.794 
0474 
1.015 
0.308 
0.528 
0.177 
0.644 
0.542 
1.083 
0.225 
2.307 
0.289 
0.761 
8.275 
0.048 
7.920 
0.792 
2.007 
3< 396 
3.877 
4.529 
0.270 
1.225 



Normal Luno Samples 



340. 
340. 
340. 
340. 
340. 
340. 
340. 



208,278. N. lung R1415 vs univ.ref 
208,278. N. lung R1431 vs ref. 
208,278. N. lung R417 vs univ. ref 
208,278. 1054LunglnflamTA2 
208,278. 1415LunglnflamTA2 
208,278. 423LunglnflamTA2 
208,278. 488-552lnflmdLungTA2 



0.433 
0.567 
1.372 
1.585 
1.408 
3.709 
0.673 



0.311 
0.407 
0.985 
1.138 
1.011 
2.664 
0.483 
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